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For our parents 



Preface 



This book explains a result called the Modularity Theorem: 



All rational elliptic curves arise from modular forms. 



Taniyama first suggested in the 1950's that a statement along these lines might 
be true, and a precise conjecture was formulated by Shimura. A paper of Weil 
[Wei67] provides strong theoretical evidence for the conjecture. The theorem 
was proved for a large class of elliptic curves by Wiles [Wil95] with a key 
ingredient supplied by joint work with Taylor [TW95] , completing the proof 
of Fermat's Last Theorem after some 350 years. The Modularity Theorem 
was proved completely by Breuil, Conrad, Taylor, and the first author of this 
book [BCDT01]. Different forms of it are stated here in Chapters 2, 6, 7, 8, 
and 9. 

To describe the theorem very simply for now, first consider a situation 
from elementary number theory. Take a quadratic equation 



The values a p (Q) extend multiplicatively to values a n (Q) for all positive in- 
tegers n, meaning that a mn (Q) = a m (Q)a n (Q) for all m and n. 

Since by definition a p (Q) is the Legendre symbol (d/p) for all p > 2, one 
statement of the Quadratic Reciprocity Theorem is that a p (Q) depends only 
on the value of p modulo A\d\. This can be reinterpreted as a statement that 
the sequence of solution-counts {d2 (Q), as(Q), a$(Q), . . .} arises as a system 
of eigenvalues on a finite-dimensional complex vector space associated to the 
equation Q. Let N — 4|d|, let G = (Z/NZ)* be the multiplicative group of 



Q : x 2 = d, 



deZ, d^O, 



and for each prime number p define an integer a p (Q), 




the number of solutions x of equation Q 
working modulo p 
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integer residue classes modulo N, and let Vn be the vector space of complex- 
valued functions on G, 

V N = {f:G^ C}. 
For each prime p define a linear operator T p on Vn, 

where the product pn £ G uses the reduction of p modulo iV. Consider a 
particular function / = /q in Vjv, 

/ : G — > C, /(n) = a„(Q) for n e G. 

This is well defined by Quadratic Reciprocity as stated above. It is immediate 
that / is an eigenvector for the operators T p , 



(T P f)(n) 



= \f(pn) = a pn (Q) = a p (Q)a n (Q) if p\N, 
[0 if p | N 

= a p (Q) f (n) in all cases. 



That is, T p f — a p (Q)f for all p. This shows that the sequence {a p (Q)} is a 
system of eigenvalues as claimed. 

The Modularity Theorem can be viewed as giving an analogous result. 
Consider a cubic equation 

E : y 2 - 4x 3 - g 2 x - g 3 , g 2 , g 3 e Z, gl - TJg\ ± 0. 

Such equations define elliptic curves, objects central to this book. For each 
prime number p define a number a p (E) akin to a p (Q) from before, 

(the number of solutions (x, y) of equation E\ 
working modulo p J 

One statement of Modularity is that again the sequence of solution-counts 
{a p (E)} arises as a system of eigenvalues. Understanding this requires some 
vocabulary. 

A modular form is a function on the complex upper half plane that sat- 
isfies certain transformation conditions and holomorphy conditions. Let t be 
a variable in the upper half plane. Then a modular form necessarily has a 
Fourier expansion, 

oo 

/(r) = a n (f)e 2mT , a n (f) G C for all n. 

Each nonzero modular form has two associated integers k and N called its 
weight and its level. The modular forms of any given weight and level form 
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a vector space. Linear operators called the Hecke operators, including an op- 
erator T p for each prime p, act on these vector spaces. An eigenform is a 
modular form that is a simultaneous eigenvector for all the Hecke operators. 
By analogy to the situation from elementary number theory, the Modular- 
ity Theorem associates to the equation E an eigenform / = /e in a vector 
space Vn of weight 2 modular forms at a level N called the conductor of E. 
The eigenvalues of / are its Fourier coefficients, 

Tp{f) = a P (.f).f for all primes p, 

and a version of Modularity is that the Fourier coefficients give the solution- 
counts, 

a p(.f) = cl p {E) for all primes p. (0.1) 

That is, the solution-counts of equation E are a system of eigenvalues, like the 
solution-counts of equation Q, but this time they arise from modular forms. 
This version of the Modularity Theorem will be stated in Chapter 8. 

Chapter 1 gives the basic definitions and some first examples of modular 
forms. It introduces elliptic curves in the context of the complex numbers, 
where they are defined as tori and then related to equations like E but with 
52,53 € C. And it introduces modular curves, quotients of the upper half 
plane that are in some sense more natural domains of modular forms than the 
upper half plane itself. Complex elliptic curves are compact Ricmann surfaces, 
meaning they are indistinguishable in the small from the complex plane. Chap- 
ter 2 shows that modular curves can be made into compact Riemann surfaces 
as well. It ends with the book's first statement of the Modularity Theorem, 
relating elliptic curves and modular curves as Riemann surfaces: If the com- 
plex number j — 1728<?f/(gf — 27<7|) is rational then the elliptic curve is the 
holomorphic image of a modular curve. This is notated 

X 0 (N) — ► E. 

Much of what follows over the next six chapters is carried out with an eye 
to going from this complex analytic version of Modularity to the arithmetic 
version (0.1). Thus this book's aim is not to prove Modularity but to state its 
different versions, showing some of the relations among them and how they 
connect to different areas of mathematics. 

Modular forms make up finite-dimensional vector spaces. To compute their 
dimensions Chapter 3 further studies modular curves as Ricmann surfaces. 
Two complementary types of modular forms are Eisenstein series and cusp 
forms. Chapter 4 discusses Eisenstein series and computes their Fourier ex- 
pansions. In the process it introduces ideas that will be used later in the book, 
especially the idea of an L-function, 

oo 

L{s) = yan 
w ^ n s 
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Here s is a complex variable restricted to some right half plane to make the 
series converge, and the coefficients a n can arise from different contexts. For 
instance, they can be the Fourier coefficients a n (f) of a modular form. Chap- 
ter 5 shows that if / is a Hecke eigcnform of weight 2 and level N then its 
L-function has an Euler factorization 

L(s, f) = [](! - a P (f)p- s + 1n{p)P 1 - 2s )- 1 - 

p 

The product is taken over primes p, and 1n(p) is 1 when p \ N (true for all 
but finitely many p) but is 0 when p \ N. 

Chapter 6 introduces the Jacobian of a modular curve, analogous to a 
complex elliptic curve in that both are complex tori and thus have Abelian 
group structure. Another version of the Modularity Theorem says that every 
complex elliptic curve with a rational j-value is the holomorphic homomorphic 
image of a Jacobian, 

J Q (N) — > E. 

Modularity refines to say that the elliptic curve is in fact the image of a 
quotient of a Jacobian, the Abelian variety associated to a weight 2 eigcnform, 

A f — > E. 

This version of Modularity associates a cusp form / to the elliptic curve E. 

Chapter 7 brings algebraic geometry into the picture and moves toward 
number theory by shifting the environment from the complex numbers to the 
rational numbers. Every complex elliptic curve with rational j-invariant can 
be associated to the solution set of an equation E with 32,33 € Q. Modular 
curves, Jacobians, and Abelian varieties are similarly associated to solution 
sets of systems of polynomial equations over Q, algebraic objects in contrast 
to the earlier complex analytic ones. The formulations of Modularity already 
in play rephrase algebraically to statements about objects and maps defined 
by polynomials over Q, 

A o (A0ai g — ► E, Jo(A0ai g — > E, A fMe — ► E. 

We discuss only the first of these in detail since Xq{N) & \ & is a curve while 
Jo(A0aig an d -<4/,aig are higher-dimensional objects beyond the scope of this 
book. These algebraic versions of Modularity have applications to number 
theory, for example constructing rational points on elliptic curves using points 
called Heegner points on modular curves. 

Chapter 8 develops the Eichler-Shimura relation, describing the Hecke 
operator T p in characteristic p. This relation and the versions of Modularity 
already stated help to establish two more versions of the Modularity Theorem. 
One is the arithmetic version that a p (f) = a p (E) for all p, as above. For the 
other, define the Hasse-Weil L-function of an elliptic curve E in terms of the 
solution-counts a p (E) and a positive integer TV called the conductor of E, 
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L(s, E) = H(l - a p (E)p- s + l N {p)p 1 - 2s )-\ 

p 

Comparing this to the Euler product form of L(s, f) above gives a version of 
Modularity equivalent to the arithmetic one: The L-function of the modular 
form is the L-function of the elliptic curve, 

L(s,f)=L(s,E). 

As a function of the complex variable s, both L- functions arc initially defined 
only on a right half plane, but Chapter 5 shows that L{s, /) extends ana- 
lytically to all of C. By Modularity the same now holds for L(s,E). This is 
important because we want to understand E as an Abclian group, and the 
conjecture of Birch and Swinncrton-Dyer is that the analytically continued 
L(s, E) contains information about the group's structure. 

Chapter 9 introduces l-adic Galois representations, certain homomor- 
phisms of Galois groups into matrix groups. Such representations are asso- 
ciated to elliptic curves and to modular forms, incorporating the ideas from 
Chapters 6 through 8 into a framework with rich additional algebraic struc- 
ture. The corresponding version of the Modularity Theorem is: Every Galois 
representation associated to an elliptic curve over Q arises from a Galois 
representation associated to a modular form, 

Pf,t ~ Pe.i- 

This is the version of Modularity that was proved. The book ends by discussing 
two broader conjectures that Galois representations arise from modular forms. 

Many good books on modular forms already exist, but they can be daunt- 
ing for a beginner. Although some of the difficulty lies in the material itself, 
the authors believe that a more expansive narrative with exercises will help 
students into the subject. We also believe that algebraic aspects of modu- 
lar forms, necessary to understand their role in number theory, can be made 
accessible to students without previous background in algebraic number the- 
ory and algebraic geometry. In the last four chapters we have tried to do so 
by introducing elements of these subjects as necessary but not letting them 
take over the text. We gratefully acknowledge our debt to the other books, 
especially to Shimura [Shi 73]. 

The minimal prerequisites are undergraduate semester courses in linear al- 
gebra, modern algebra, real analysis, complex analysis, and elementary num- 
ber theory. Topics such as holomorphic and meromorphic functions, congru- 
ences, Euler's totient function, the Chinese Remainder Theorem, basics of 
general point set topology, and the structure theorem for modules over a 
principal ideal domain are used freely from the beginning, and the Spectral 
Theorem of linear algebra is cited in Chapter 5. A few facts about represen- 
tations and tensor products are also cited in Chapter 5, and Galois theory is 
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used extensively in the later chapters. Chapter 3 quotes formulas from Rie- 
mann surface theory, and later in the book Chapters 6 through 9 cite steadily 
more results from Ricmann surface theory, algebraic geometry, and algebraic 
number theory. Seeing these presented in context should help the reader ab- 
sorb the new language necessary en route to the arithmetic and representation 
theoretic versions of Modularity. 

We thank our colleagues Joe Buhler, David Cox, Paul Garrett, Cris Poor, 
Richard Taylor, and David Yuen, Reed College students Ashcr Auel, Rachel 
Epstein, Harold Gabel, Michael Lieberman, Peter McMahan, and John Sailer, 
and Brandeis University student Makis Dousmanis for looking at drafts. Com- 
ments and corrections should be sent to the second author at j erryOreed . edu. 



July 2004 Fred Diamond 

Brandeis University 
Waltham, MA 



Jerry Shurman 
Reed College 
Portland, OR 
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Modular Forms, Elliptic Curves, and Modular 
Curves 



This chapter introduces three central objects of the book. 

Modular forms are functions on the complex upper half plane. A matrix 
group called the modular group acts on the upper half plane, and modular 
forms are the functions that transform in a nearly invariant way under the 
action and satisfy a holomorphy condition. Restricting the action to subgroups 
of the modular group called congruence subgroups gives rise to more modular 
forms. 

A complex elliptic curve is a quotient of the complex plane by a lattice. 
As such it is an Abelian group, a compact Riemann surface, a torus, and — 
nonobviously — in bijective correspondence with the set of ordered pairs of 
complex numbers satisfying a cubic equation of the form E in the preface. 

A modular curve is a quotient of the upper half plane by the action of a 
congruence subgroup. That is, two points are considered the same if the group 
takes one to the other. 

These three kinds of object are closely related. Modular curves are mapped 
to by moduli spaces, equivalence classes of complex elliptic curves enhanced 
by associated torsion data. Thus the points of modular curves represent en- 
hanced elliptic curves. Consequently, functions on the moduli spaces satisfying 
a homogeneity condition are essentially the same thing as modular forms. 

Related reading: Gunning [Gun62], Koblitz [Kob93], Schoeneberg [Sch74], 
and Chapter 7 of Serre [Ser73] are standard first texts on this subject. For 
modern expositions of classical modular forms in action see [Cox84] (reprinted 
in [BBBOO]) and [Cox97]. 

1.1 First definitions and examples 

The modular group is the group of 2-by-2 matrices with integer entries and 
determinant 1, 
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The modular group is generated by the two matrices 



1 1 
0 1 



and 



(Exercise 1.1.1). Each element of the modular group is also viewed as an 
automorphism (invertiblc self-map) of the Riemann sphere C = CU{oo}, the 
fractional linear transformation 



a b 

c d 



(r) 



ar - 



d 



t e c. 



This is understood to mean that if c ^ 0 then — d/c maps to oo and oo 
maps to a/c, and if c = 0 then oo maps to oo. The identity matrix I and 
its negative —I both give the identity transformation, and more generally 
each pair ±7 of matrices in SL 2 (Z) gives a single transformation. The group 
of transformations defined by the modular group is generated by the maps 
described by the two matrix generators, 

t^tII and t 1 ^ 1 J t . 

The upper half plane is 

n = {reC: Im(r) > 0}. 

Readers with some background in Riemann surface theory — which is not nec- 
essary to read this book — may recognize H as one of the three simply con- 
nected Riemann surfaces, the other two being the plane C and the sphere C. 
The formula 



Im(7(Y)) = 



Im(r) 

\cT+df 



7 = 



a b 
c d 



G SL 2 (Z) 



(Exercise 1.1.2(a)) shows that if 7 e SL 2 (Z) and r <G H then also 7(7-) G H, 
i.e., the modular group maps the upper half plane back to itself. In fact the 
modular group acts on the upper half plane, meaning that I(t) = r where 
I is the identity matrix (as was already noted) and (77') (r) = 7(7' (r)) for 
all 7,7' e SL 2 (Z) and r € %. This last formula is easy to check (Exer- 
cise 1.1.2(b)). 

Definition 1.1.1. Let k be an integer. A meromorphic function f : % — > C 
is weakly modular of weight k if 



fh(T)) = (cT + d) k f(r) for 7 = 



a b 
c d 



G SL 2 (Z) and t G H. 



Section 1.2 will show that if this transformation law holds when 7 is each 
of the generators and [""J] then it holds for all 7 G SL 2 (Z). In other 
words, / is weakly modular of weight k if 
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/(r + l) = /(r) and /(-1/r) = r k f(r). 

Weak modularity of weight 0 is simply SL 2 (Z)-invariance, / o 7 = / for 
all 7 e SL 2 (Z). Weak modularity of weight 2 is also natural: complex analy- 
sis relies on path integrals of differentials f{r)dr, and SL2(Z)-invariant path 
integration on the upper half plane requires such differentials to be invariant 
when r is replaced by any 7 (r). But (Exercise 1.1.2(c)) 

d-f(r) = (cT + d)~ 2 dT, 

and so the relation /(7(r))d(7(r)) = /(r)dr is 

/( 7 (r)) = ( C r + rf) 2 /(r), 

giving Definition 1.1.1 with weight k — 2. Weight 2 will play an especially 
important role later in this book since it is the weight of the modular form in 
the Modularity Theorem. The weight 2 case also leads inexorably to higher 
even weights — multiplying two weakly modular functions of weight 2 gives a 
weakly modular function of weight 4, and so on. Letting 7 = —I in Defini- 
tion 1.1.1 gives / = (— l) fc /, showing that the only weakly modular function 
of any odd weight k is the zero function, but nonzero odd weight examples 
exist in more general contexts to be developed soon. Another motivating idea 
for weak modularity is that while it does not make a function / fully SL 2 (Z)- 
invariant, at least /(r) and /( 7 (t)) always have the same zeros and poles 
since the factor cr + d on H has neither. 

Modular forms are weakly modular functions that are also holomorphic on 
the upper half plane and holomorphic at 00. To define this last notion, recall 
that SL 2 (Z) contains the translation matrix 



1 1 
0 1 



IT4T+1, 



for which the factor ct + d is simply 1, so that /(r + 1) = /(r) for every 
weakly modular function / : % — > C. That is, weakly modular functions are 
Z-periodic. Let D = {q e C : \q\ < 1} be the open complex unit disk, let 
D' = D — {0}, and recall from complex analysis that the Z-pcriodic holomor- 
phic map t h-> e 2mT — q takes % to D' . Thus, corresponding to /, the function 
g : D' — > C where g(q) = f (log(q) / (2iri)) is well defined even though the 
logarithm is only determined up to 2niZ, and /(t) = g(e 27TlT ). If / is holo- 
morphic on the upper half plane then the composition g is holomorphic on 
the punctured disk since the logarithm can be defined holomorphically about 
each point, and so g has a Laurent expansion g{q) = X)nez a "9™ f° r 9 £ D' . 
The relation \q\ = e - 27rIm ( r ) shows that q -> 0 as Im(r) -> 00. So, thinking 
of 00 as lying far in the imaginary direction, define / to be holomorphic at 00 
if g extends holomorphically to the puncture point q — 0, i.e., the Laurent 
series sums over n £ N. This means that / has a Fourier expansion 
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/(r) = X>„(/)g", q = e 2 ™ 

n=0 



Since g — ^ 0 if and only if Ihi(t) — > oo, showing that a weakly modular holo- 
morphic function / : % — > C is holomorphic at oo doesn't require computing 
its Fourier expansion, only showing that lim Im ( T )^ 00 /(t) exists or even just 
that /(t) is bounded as Im(r) — > oo. 

Definition 1.1.2. Let k be an integer. A function f : % — > C is a modular 
form of weight k if 

(1) / is holomorphic on H, 

(2) / is weakly modular of weight k, 

(3) / is holomorphic at oo. 

The set of modular forms of weight k is denoted .Mfc(SL 2 (Z)). 

It is easy to check that A / J/t(SL 2 (Z)) forms a vector space over C (Exer- 
cise 1.1.3(a)). Holomorphy at oo will make the dimension of this space, and 
of more spaces of modular forms to be defined in the next section, finite. We 
will compute many dimension formulas in Chapter 3. When / is holomorphic 
at oo it is tempting to define /(oo) = g(0) = a 0 , but the next section will 
show that this doesn't work in a more general context. 

The product of a modular form of weight k with a modular form of weight I 
is a modular form of weight k + I (Exercise 1.1.3(b)). Thus the sum 

X(SL 2 (Z)) = 0M fc (SL 2 (Z)) 
fcez 

forms a ring, a so-called graded ring because of its structure as a sum. 

The zero function on H is a modular form of every weight, and every 
constant function on % is a modular form of weight 0. For nontrivial examples 
of modular forms, let k > 2 be an even integer and define the Eisenstein 
series of weight k to be a 2-dimensional analog of the Riemann zeta function 

C(*) = ££=iVd fc , 

G ^£'(c7T^< Ten > 

(c.d) y ' 

where the primed summation sign means to sum over nonzero integer pairs 
(c, d) £ Z 2 — {(0,0)}. The sum is absolutely convergent and converges uni- 
formly on compact subsets of % (Exercise 1.1.4(c)), so Gk is holomorphic 
on H and its terms may be rearranged. For any 7 = [ a c h d \ € SL 2 (Z), compute 
that 
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= (cr + d ) k J2 ((c'a + d'c)T+(c'b + d'd)) k ' 

(d .d' ) 

But as (c',<f) runs through Z 2 - {(0,0)}, so does (c'a + d'c,db + d'd) = 
( C V) [cd] (Exercise 1.1.4(d)), and so the right side is (cr + d) k G k {r), 
showing that is weakly modular of weight k. Finally, G^ is bounded 
as Im(r) — > oo (Exercise 1.1.4(e)), so it is a modular form. 

To compute the Fourier series for Gk, continue to let r <G % and begin 
with the identities 

OO , s OO 

-+^2[—-, + -) =TTCotTTT^m-2TTi^2q m , q = e 2mT (1.1) 



r 



d=l x 7 m=0 



(Exercise 1.1.5 — the reader who is unhappy with this unmotivated incanting 
of unfamiliar expressions for a trigonometric function should be reassured that 
it is a standard rite of passage into modular forms; but also, Exercise 1.1.6 
provides other proofs, perhaps more natural, of the following formula (1.2)). 
Differentiating (1.1) k— 1 times with respect to r gives for r GW and q = e 27r4T , 

T T-^k = 77^ E ™ fe ~V n , k>2. (1.2) 
^(r + d) fc (fc-1! ^ y ~ v ; 

dGZ v ; v ; m=l 

For even k > 2, 

^ (cr + d) k = ^2~d k ~ IS ( cr + rf)fe ) ' 

(c,d) V ' d^O c=l Vzez v y / 

so again letting £ denote the Riemann zeta function and using (1.2) gives 

Rearranging the last expression gives the Fourier expansion 

G fc (r)=2C(fc)+2^^- j ^ t 7 fe _ 1 (n) g ", k > 2, k even 

^ n=l 

where the coefficient <Tk-i(n) is the arithmetic function 



o-fc_i(n) = E TO 



fe-i 



m|n 
m>0 
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Exercise 1.1.7(b) shows that dividing by the leading coefficient gives a se- 
ries having rational coefficients with a common denominator. This normalized 
Eisenstein series Gk(r)/(2((k)) is denoted Ek(r). The Riemann zeta function 
will be discussed further in Chapter 4. 

Since the set of modular forms is a graded ring, we can make modular 
forms out of various sums of products of the Eisenstein series. For example, 
Ais(SL 2 (Z)) turns out to be 1-dimensional. The functions E^t) 2 and E$(t) 
both belong to this space, making them equal up to a scalar multiple and 
therefore equal since both have leading term 1. Expanding out the relation 
El = E s gives a relation between the divisor-sum functions u 3 and 07 (Exer- 
cise 1.1.7(c)), 

ra-l 

<7y(n) =CT 3 (n) + 120 ^ a 3 (i)a 3 (n - i), n > 1. (1.3) 

i=l 

The modular forms that, unlike Eisenstein series, have constant term equal 
to 0 play an important role in the subject. 

Definition 1.1.3. A cusp form of weight k is a modular form of weight k 
whose Fourier expansion has leading coefficient a 0 = 0, i.e., 

00 

/(r) = 5> n( f, q = e 2 ™. 

n=l 

The set of cusp forms is denoted iSfc(SL 2 (Z)). 

So a modular form is a cusp form when lim Im ( T )^ 00 /(r) = 0. The limit 
point 00 of H is called the cusp o/SL 2 (Z) for geometric reasons to be explained 
in Chapter 2, and a cusp form can be viewed as vanishing at the cusp. The cusp 
forms 5fc(SL 2 (Z)) form a vector subspace of the modular forms Atfc(SL 2 (Z)), 
and the graded ring 

5(SL 2 (Z)) = 05 fe (SL 2 (Z)) 
kez 

is an ideal in A^(SL 2 (Z)) (Exercise 1.1.3(c)). 
For an example of a cusp form, let 

g 2 (r) = 60G 4 (r), g 3 ( T ) = 140G 6 (r), 

and define the discriminant function 

A:H^C, A(t) = ( 92 (t)) 3 - 27( 93 (t)) 2 . 



Then A is weakly modular of weight 12 and holomorphic on T-L, and do = 0, 
di = (27r) 12 in the Fourier expansion of A (Exercise 1.1.7(d)). So indeed 
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A G Si 2 (SL 2 (Z)), and A is not the zero function. Section 1.4 will show that 
in fact A(t) ^ 0 for all t G H so that the only zero of A is at oo. 
It follows that the modular function 

j '■ U — > C, j(r) = 1728^^ 

is holomorphic on "H. Since the numerator and denominator of j have the 
same weight, j is SL 2 (Z)-invariant, 

i(7(r))=i(r), 7 G SL 2 (Z), t EH, 

and in fact it is also called the modular invariant. The expansion 

(2^) 12 + -.. 1 



i(r) 



(2^) 12 g+--- q 



shows that j has a simple pole at oo (and is normalized to have residue 1 at 
the pole), so it is not quite a modular form. Let /X3 denote the complex cube 
root of unity e 27 ™/ 3 . Easy calculations (Exercise 1.1.8) show that #3(1) = 0 
so that g 2 (i) 7^ 0 and j(i) — 1728, and 52(^3) — 0 so that 93(^3) 0 an d 
jd-13) = 0. One can further show (see [Ros81], [CSar]) that 

g 2 {i) = Ami Wi = 2 u = = 2y/n 

Jo 



VT^¥ v r(3/4) 



and 



9a( «) = (27/16)„|, ^-"jf^j-VfM. 

Here the integrals are elliptic integrals, and -T is Euler's gamma function, to be 
defined in Chapter 4. Finally, Exercise 1.1.9 shows that the j-function surjects 
from W to C. 



Exercises 

1.1.1. Let r be the subgroup of SL 2 (Z) generated by the two matrices 
and Note that [J?] = [J}] n G P for all n G Z. Let a = be a 

matrix in SL 2 (Z). Use the identity 



a 6 




'1 n~ 




'a 6' 


c 




0 1 




c nc + d 



to show that unless c—0, some matrix with 7 G -T has bottom row (c, cf ) 
with |d'| < |c|/2. Use the identity 



a b 




0 


-1" 




6 —a 


c d 




1 


0 




d — c 



8 



1 Modular Forms, Elliptic Curves, and Modular Curves 



to show that this process can be iterated until some matrix with 7 e r 
has bottom row (0, *). Show that in fact the bottom row is (0, ±1), and since 

[1 ~o] 2 = — I ^ can b e taken to be (0, 1). Show that therefore € r and so 
a e r. Thus r is all of SL 2 (Z). 

1.1.2. (a) Show that Im( 7 (T)) = Im(r)/|cr + d\ 2 for all 7 = £ SL 2 (Z). 

(b) Show that (t/)(t) = 7(7'(r)) for all 7,7' e SL 2 (Z) and reH. 

(c) Show that d 7 (r)/dT = 1/(ct + d) 2 for 7 = [ a c b d ] £ SL 2 (Z). 

1.1.3. (a) Show that the set A4fc(SL 2 (Z)) of modular forms of weight k forms 
a vector space over C. 

(b) If / is a modular form of weight k and g is a modular form of weight I, 
show that fg is a modular form of weight k + I. 

(c) Show that <Sfc(SL 2 (Z)) is a vector subspace of A^fe(SL 2 (Z)) and that 
<S(SL 2 (Z)) is an ideal in M(SL 2 (Z)). 

1.1.4. Let k > 3 be an integer and let V = 7? - {(0, 0)}. 

(a) Show that the series d) e ^/(sup{|c|, |c?|}) _fc converges by considering 
the partial sums over expanding squares. 

(b) Fix positive numbers A and B and let 

Q = {t g H : |Re(r)| < A,Im(r) > B}. 

Prove that there is a constant C > 0 such that |r + <5| > C sup{l, \ S\} for all 
t e i7 and 5 e R. (Hints for this exercise are at the end of the book.) 

(c) Use parts (a) and (b) to prove that the series defining Gk (r) converges 
absolutely and uniformly for r € (2. Conclude that Gk is holomorphic on %. 

(d) Show that for 7 £ SL 2 (Z), right multiplication by 7 defines a bijection 
from L' to L' . 

(e) Use the calculation from (c) to show that Gk is bounded on fi. From the 
text and part (d), Gk is weakly modular so in particular Gk(r + 1) = Gfc(r). 
Show that therefore Gfc(r) is bounded as Im(r) — > 00. 

1.1.5. Establish the two formulas for 7rcot ttt in (1.1). (A hint for this exercise 
is at the end of the book.) 

1.1.6. This exercise obtains formula (1.2) without using the cotangent. Let 
/(t) = X)dez l/( r + d) h for k > 2 and t £ H. Since / is holomorphic (by 
the method of Exercise 1.1.4) and Z-periodic and since limim^)-^ /(t) = 0, 
there is a Fourier expansion /(r) = J2m=i tt m9 m — 9(l) as m * ne section, 
where g = e 27r4T and 

am= 2^lg^ dq 

is a path integral once counterclockwise over a circle about 0 in the punctured 
disk D' . 

(a) Show that 
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rl+iy r+oo+iy 

a m = f{T)e~ 2nmT dT = I T - k e- 2mmT dT for any y > 0. 

Jr—O+iy J r— — oo+iy 



(b) Let g m {r) = t k e 27 ™ mr ; a meromorphic function on C with its only 
singularity at the origin. Show that 

-27riRes r=o5m (r) = )- ^-m fe_1 . 

(fe-1)! 

(c) Establish (1.2) by integrating <? m (r) clockwise about a large rectangular 
path and applying the Residue Theorem. Argue that the integral along the 
top side goes to a m and the integrals along the other three sides go to 0. 

(d) Let h : R — > C be a function such that the integral \h(x)\dx 
is finite and the sum Xdez M x + d) converges absolutely and uniformly on 
compact subsets and is infinitely differentiablc. Then the Poisson summation 
formula says that 



deZ mEZ 



h(x + d) = V h{m)e 2nimx 



where h is the Fourier transform of h, 



POO 

h(x) = / h(t) 

J t— — oo 



, e~ zmxt dt. 

t— — oo 

We will not prove this, but the idea is that the left side sum symmetrizes h to 
a function of period 1 and the right side sum is the Fourier series of the left 
side since the mth Fourier coefficient is J X^ez + d)e~ 27Tlmt dt = h(m). 
Letting h(x) = l/r k where r = x + iy with y > 0, show that h meets the 
conditions for Poisson summation. Show that h(m) = e~ 27rmy a m with a m 
from above for m > 0, and that h(m) = 0 for m < 0. Establish formula (1.2) 
again, this time as a special case of Poisson summation. We will see more 
Poisson summation and Fourier analysis in connection with Eisenstein series 
in Chapter 4. (A hint for this exercise is at the end of the book.) 

1.1.7. The Bernoulli numbers Bk are defined by the formal power series ex- 
pansion 

t ^ ffc 



fe=0 

Thus they are calculable in succession by matching coefficients in the power 
series identity 

k=0 n=l \fe=0 v 7 / 



(i.e., the nth parenthesized sum is 1 if n = 1 and 0 otherwise) and they are 
rational. Since the expression 



10 



1 Modular Forms, Elliptic Curves, and Modular Curves 



t t _ t e* + 1 

e* - 1 + 2 ~ 2 ' e* - 1 

is even, it follows that B\ = —1/2 and — 0 for all other odd fc. The 
Bernoulli numbers will be motivated, discussed, and generalized in Chapter 4. 

(a) Show that B 2 = 1/6, B 4 = -1/30, and B 6 = 1/42. 

(b) Use the expressions for ir cot ttt from the section to show 



1 - 2 ^ ((2k)T 2k = TTT COt TTT = lliT + ^ B k 



{2mrf 



k\ ' 

fe=l k=0 

Use these to show that for k > 2 even, the Riemann zeta function satisfies 

2C(*) = -^* fc) 

so in particular C(2) = tt 2 /6, C(4) = tt 4 /90, and £(6) = tt 6 /945. Also, this 
shows that the normalized Eisenstein series of weight k 



£fc(T)= 2CW =1 -^g afc - l( ^ 



Gfc(r) = 2fc 
2C(fc) Bfe 

has rational coefficients with a common denominator. 

(c) Equate coefficients in the relation E 8 (t) — E±(t) 2 to establish for- 
mula (1.3). 

(d) Show that ao = 0 and a\ = (27r) 12 in the Fourier expansion of the 
discriminant function A from the text. 

1.1.8. Recall that ^3 denotes the complex cube root of unity e 27 ™/ 3 . Show that 
[1 ~o] (M3) = P3 + 1 so that by periodicity g 2 {[\ (^3)) = 92^3) ■ Show 
that by modularity also <72([i _ o] (a*3)) = ^332(^3) and therefore #2(^3) = 0. 
Conclude that #3(^3) ^ 0 and — 0. Argue similarly to show that 53 (i) = 
0, ^ 0, and = 1728. 

1.1.9. This exercise shows that the modular invariant j : % — > C is a sur- 
jection. Suppose that c £ C and j(r) 7^ c for all t £%. Consider the integral 

1 f j'( T )dr 



2iri J 1 j(r) - c 

where 7 is the contour shown in Figure 1.1 containing an arc of the unit 
circle from (—1 + i\/3)/2 to (1 + i-\/2>)/2, two vertical segments up to any 
height greater than 1, and a horizontal segment. By the Argument Principle 
the integral is 0. Use the fact that j is invariant under [q\] to show that 
the integrals over the two vertical segments cancel. Use the fact that j is 
invariant under [ J _ J ] to show that the integrals over the two halves of the 
circular arc cancel. For the integral over the remaining piece of 7 make the 
change of coordinates q — e 2T " T , remembering that j'(r) denotes derivative 
with respect to t and that j(r) = l/q + ■ ■ ■ , and compute that it equals 1. 
This contradiction shows that j(r) = c for some r £ % and j surjects. 
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Figure 1.1. A contour 



1.2 Congruence subgroups 

Section 1.1 stated that if a meromorphic function / : H — > C satisfies 



fh(T)) = (cr + d) k f(T) for 7 
then / is weakly modular, i.e, 

fh(r)) = (cT + d) k f(T) for all 7 



1 1 
0 1 



and 7 



0 -1 

1 0 



a b 
c d 



€ SL 2 (Z). 



Replacing the modular group SL 2 (Z) in this last condition by a subgroup r 
generalizes the notion of weak modularity allowing more examples of weakly 
modular functions. 

For example, a subgroup arises from the four squares problem in number 
theory, to find the number of ways (if any) that a given nonnegative integer n 
can be expressed as the sum of four integer squares. To address this, define 
more generally for nonnegative integers n and k the representation number 
of n by k squares, 

r(n, k) = #{w e Z fe : n = v\ H V v 2 k }. 

Note that if i + j = k then r(n,k) = ^2 l+m=n r(l,i)r(m, j), summing over 
nonnegative values of I and m that add to n (Exercise 1.2.1). This looks like 
the rule c„ = X)z+m=n a i^m relating the coefficients in the formal product of 
two power series, 




,.q 



m=0 



E 

n=0 



c„q 



So consider the generating function of the representation numbers, meaning 
the power series with nth coefficient r(n, k), 

oo 
n=0 
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Chapter 4 will show that the series is absolutely convergent, and so the formal 
product relation between power series applies to the 0 series, 

6(T,k 1 )6(T,k 2 ) = 0(T,k 1 + k 2 ), teU. 

Clearly 0 satisfies the transformation formula 

0(r + l,fc) = 9(T,k). 

To obtain another such law for 9, write 9{t) for 9(t, 1) and note that the 
definition of the representation numbers combines with absolute convergence 
to give the rearrangement 



9(t) = J2e 2md T - 
dez 

This looks like the left side of the Poisson summation formula from Exer- 
cise 1.1.6(d) with x — 0 and h(d) = e 27Tld T . Chapter 4 will carry out the 
summation, obtaining the transformation law 

0(-1/(4t)) - V^t9{t). 

(This formula calls for the principal branch of square root, making sense since 
— 2ir lies in the right half plane.) The matrix [° ~q 

does not have determinant 1, but the product _° 1 ^ 4 
taking t to t/(4t+ 1), does. Applying the corresponding succession of trans- 
formations and the transformation laws for 9 gives 



taking r to — l/(4r) 
[o i] [4 0] 



1 01 
4 iJ> 




2/ I - + 1 ) (-2ir) 9{t) 



= V4r + 16»(t). 



Now the product relation 9(t, 4) = 9(t) 4 gives a second transformation for- 
mula for the four squares problem, 



4 = (4t + 1) 2 0(t,4). 



v 4t + :T 

That is, analogously to the first formula of this section, 



0(7(1-), 4) = (ct + (1) 2 9(t, 4) for 7 = ± 



1 1 
0 1 



and 7 = ± 



1 0 
4 1 



The subgroup of SL2(Z) associated to the four squares problem is the group 
generated by ± [ J \ ] and ± [ \ J ] . We will soon see that this is a special instance 
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of the subgroups about to be introduced more generally. The four squares 
problem will be solved at the end of the section once more ideas are in place. 



Let N be a positive integer. The principal congruence subgroup of level N 



is 



r(N) 



a b 
c d 



£ SL 2 (Z) : 



a b 
c d 



1 0 
0 1 



(mod N) 



Y 



(The matrix congruence is interpreted entrywise, i.e., a = 1 (mod N) and so 
on.) In particular -T(l) = SLi2(Z). Being the kernel of the natural homomor- 
phism SL 2 (Z) — ► SL 2 (Z/iVZ), the subgroup r(N) is normal in SL 2 (Z). In 
fact the map is a surjection (Exercise 1.2.2(b)), inducing an isomorphism 

SL 2 (Z)/r(N) -A- SL 2 (Z/7VZ). 

This shows that [SL 2 (Z) : r(N)] is finite for all N. Specifically, the index is 



[SL 2 (Z) :r(JV)] = JV 3 JJ (1 



p\N 

where the product is taken over all prime divisors of N (Exercise 1.2.3(b)). 

Definition 1.2.1. A subgroup r of SL 2 (Z) is a congruence subgroup if 

r(N) C r for some N £ Z + , in which case r is a congruence subgroup of 
level N. 

Thus every congruence subgroup r has finite index in SL 2 (Z). Besides the 
principal congruence subgroups, the most important congruence subgroups 
are 



r 0 (N) = 



a b 
c d 



£ SL 2 (Z) 



a b 
c d 



* * 
0 * 



(mod N) 



(where "*" means "unspecified") and 



A (AO 



satisfying 
The map 



a b 

c d 



£ SL 2 (Z) 



a b 

c d 



1 * 
0 1 



(mod N) 



r{N) c a(ao c r 0 (N) c sl 2 (Z). 



A (AT) — > Z/NZ, 



a b 
c d 



i y b (mod N) 



is a surjection with kernel r(N) (Exercise 1.2.3(c)). Therefore r(N) < ^(N) 
and 

ri(Ar)/r(AT) Z/NZ, [Ji (N) : r(N)] = N. 
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Similarly the map 



r 0 (N) — > (Z/NZ)*, 



a b 

c d 



i y d (mod N) 



is a surjection with kernel A (AT) (Exercise 1.2.3(d)), so that A (AT) < A (AO 
and 

^oW/AW (Z/NZ)*, [r 0 (N) : A (TV)] = <f>(N), 

where <j> is the Euler totient function from number theory. (Recall that <j)(N) 
counts the elements of {0, . . . , AT — 1} relatively prime to N, so that (f>(l) = 1 
and (j)(N) = \(Z/NZ)*\ for N > 1; see, for example, [IR92] Chapter 2.) It 
follows that [SL 2 (Z) : r 0 (N)} = N]] plN (l + l/p), the product taken over all 
primes dividing N (Exercise 1.2.3(e)). 

Returning briefly to the example of the four squares problem, its associ- 
ated group A generated by the matrices ± [ J \ ] and ± [ \ \ ] is A (4) (Exer- 
cise 1.2.4). 

Two pieces of notation are essential before we continue. For any matrix 
7 = [°d] e SL 2 (Z) define the factor of automorphy j(7> r) e C for t e H to 
be 

j( 7 ,r) = cr + rf, 

and for 7 e SL 2 (Z) and any integer k define the weight-k operator [y]k on 
functions / : H — > C by 

(/[7] fc )(r)=J-(7,r)- fe /(7(r)), r eU. 

Since the factor of automorphy is never zero or infinity, if / is meromorphic 
then f[j]k is also meromorphic and has the same zeros and poles as /. Now we 
can define weakly modular of weight k with respect to r to mean meromorphic 
and weight-fc invariant under A that is, a meromorphic function / on % is 
weakly modular of weight k if 

fUk = f for all 7 er. 

If / is weakly modular of weight k with respect to _T then its zeros and poles 
are /^-invariant as sets. The factor of automorphy and the weight-A: operator 
will be ubiquitous from now on. The next lemma lists some of their basic 
properties. 

Lemma 1.2.2. For all 7,7' e SL 2 (Z) and r eU, 

(a) j(n',r) = j(a,i(r))j(i,T), 

(b) (77 / )W=7(7'(t)), 

(c) [77'] k = [l]k[l']k {this is an equality of operators), 
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(e) 



dr 



i(7,T) 2 



Proof. Some of these have already been established in Exercise 1.1.2 by com- 
puting with the matrix entries, and the rest can be shown similarly. But the 
results are now phrased intrinsically, i.e., with no direct reference to the ma- 
trix entries, and so a proof in the right spirit should be intrinsic as well. Each 
element 7 of SL 2 (Z) acts on column vectors via multiplication and acts on 
points as a fractional linear transformation, and computing with the matrix 
entries only once shows that the relation between these actions is 



7 



T 




7(r) 


1 




1 



Applying this identity repeatedly gives 
77'-[;] = [ (77 ; )(T) ]i(77',r) : 
7-7 i =7 3K1 ,t) = 



7( 7 '(r)) 
1 



,t). 



The left sides are equal, hence so are the right sides. Equating the lower entries 
of the right sides proves (a), and then equating the upper entries proves (b). 
Next, for any / : H — > C compute that 

(/[77'] fc )(r)=j(77',r)- fc /((77')(T)), 
((/[7] fc )[7'] fc )(r)-. ? (7',r)- fc (/[7] fe )(7'(r)) 

= .7(7',r)- fe 7 (7,7'(T))- fc /(7(7'(r))). 

The right sides are equal by parts (a) and (b), hence so are the left sides, 
proving (c). For parts (d) and (e), juxtapose two copies of the relation between 
the actions of 7 side by side to get 



7 



Taking determinants and letting r' — > r gives (e). Setting t' — T (complex 
conjugate), noting 7(7=) = 7(7-) and ,7(7, r) = j(-f, r), and taking determinants 
gives (d). □ 

One consequence of the lemma is that if a function / : H — > C is weakly 
modular of weight k with respect to some set of matrices then / is weakly 
modular of weight k with respect to the group of matrices the set generates. 
Thus the fact that the matrices [q\\ and [10] g enera te SL 2 (Z) justifies 
the claim that weak modularity needs to be checked only for these matrices 
in order that it hold for the full group. Similarly, the function 9(t, 4) from 



"tt'~ 




7(7-) 7(7-') 




3(7, t) 0 


1 1 




1 1 




L 0 i( 7 ,r 
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the four squares problem is weakly modular of weight 2 with respect to the 
group r g = r 0 (4) generated by ± [ J {] and ± [ \ $>]. 

We now develop the definition of a modular form with respect to a con- 
gruence subgroup. Let k be an integer and let r be a congruence subgroup of 
SL 2 (Z). A function / : % — > C is a modular form of weight k with respect 
to r if it is weakly modular of weight k with respect to r and satisfies a 
holomorphy condition to be described below. When —I^T, nonzero modu- 
lar forms of odd weight may well exist, in contrast to the case of SL 2 (Z) in 
Section 1.1. 

Each congruence subgroup r of SL 2 (Z) contains a translation matrix of 
the form 



for some minimal h e Z+. This is because r contains r(N) for some N, but 
h may properly divide N — for example, the group A (AO contains the trans- 
lation matrix [J }]. Every function / : H — > C that is weakly modular with 
respect to r therefore is /iZ-periodic and thus has a corresponding function 
g : D' — > C where again D' is the punctured disk but now /(r) = g{qh) 
where qn = e 2 ^ lT ^ h . As before, if / is also holomorphic on the upper half 
plane then g is holomorphic on the punctured disk and so it has a Laurent 
expansion. Define such / to be holomorphic at oo if g extends holomorphically 
to q = 0. Thus / has a Fourier expansion 

oo 

/(r) = ^>„C q h = e 2 ™ ' h . 

71=0 

To keep the vector spaces of modular forms finite-dimensional, modular 
forms need to be holomorphic not only on H but at limit points. For a con- 
gruence subgroup r the idea is to adjoin not only oo but also the rational 
numbers Q to H, and then to identify adjoined points under Inequivalence. A 
-T-equivalence class of points in Q U {oo} is called a cusp of T. The geometry 
motivating the term "cusp" will be explained in Chapter 2. When r = SL 2 (Z) 
all rational numbers are /^-equivalent to oo and so SL 2 (Z) has only one cusp, 
represented by oo. But when r is a proper subgroup of SL 2 (Z) fewer points 
are .T-equivalent and so r will have other cusps as well, represented by ratio- 
nal numbers. Since each seQ takes the form s = a(oo) for some a € SL 2 (Z), 
the number of cusps is at most the number of cosets Fa in SL 2 (Z) but possi- 
bly fewer, a finite number since the index [SL 2 (Z) : r] is finite. We will count 
the cusps in Chapter 3. 

A modular form with respect to a congruence subgroup r should be holo- 
morphic at the cusps. Writing any s £ QU {oo} as s = a(oo), holomorphy 
at s is naturally defined in terms of holomorphy at oo via the [a]k opera- 
tor. Since f[a]k is holomorphic on "H and weakly modular with respect to 
a~ 1 ra, again a congruence subgroup of SL 2 (Z) (Exercise 1.2.5), the notion 
of its holomorphy at oo makes sense. 
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Definition 1.2.3. Let r be a congruence subgroup o/SL 2 (Z) and let k be an 
integer. A function f : H — > C is a modular form of weight k with 
respect to r if 

(1) / is holomorphic, 

(2) / is weight-k invariant under r, 

(3) f[a]k is holomorphic at oo for all a £ SL 2 (Z). 

If in addition, 

(4) ao = 0 in the Fourier expansion of f[a]k for all a £ SL 2 (Z), 

then f is a cusp form of weight k with respect to r . The modular forms of 
weight k with respect to r are denoted M. k {r), the cusp forms Su{r). 

The cusp conditions (3) and (4) are phrased independently of the con- 
gruence subgroup r. But in fact f[a]k only needs to be checked in (3) 
and (4) for the finitely many coset representatives aj in any decomposition 
SL 2 (Z) = [J. raj since f[yaj]k — f[oij]k for all 7 e T by the second con- 
dition, and these representatives depend on r. In connection with the third 
condition, we will need the following result (Exercise 1.2.6) in Chapter 4. 

Proposition 1.2.4. Let r be a congruence subgroup o/SL 2 (Z) of level N, 
and let q^q — e 2 ~ KlT / N for r € %. Suppose that the function f : % — > C 
satisfies conditions (1) and (2) in Definition 1.2.3 and satisfies 

(3') In the Fourier expansion /(r) = X^^Lo a «9Ar> ^ e coefficients for n > 0 
satisfy the condition 

|a„j < C'n r for some positive constants C and r. 

Then f also satisfies condition (3) of Definition 1.2.3, and so f £ Mk(r). 

For instance, the proposition easily shows that 9{t, 4) £ A / l 2 (/o(4)). Con- 
dition (3) also implies condition (3'), as we will finally see in Section 5.9. 

Definition 1.2.3 reduces to previous ones when r = SL 2 (Z). As before, the 
modular forms and the cusp forms are vector spaces and subspaces, and the 
sums 

M(r) = Q)M k (r) and 5(r) = 05 fc (r) 

feez fcez 

form a graded ring and a graded ideal. 

The third condition of Definition 1.2.3 does not associate a unique Fourier 
series expansion of / to a rational number s or even to s = 00. If the matrix 
a £ SL 2 (Z) takes 00 to s then so does any matrix ±a(3 where (3 = [${] with 
j £ Z. Note that (/[±a/3] fe )(r) = (±l) fc (/[a] fe )(r + j) (Exercise 1.2.7). Let h' 
be the smallest positive integer such that [J ^ ] € aT^Fct. (Since r(N) C r 
and r(N) is normal in SL 2 (Z), it follows that r(N) C oT x Fa and so h! \ N.) 
If 
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oo 

(/H*)W = E°»?V. q h '=e 2 ™' h ' 

then since e 2m ( T +i)/ h ' = ^ h ,q h , where /i^/ = e 2m / h ' is the complex h'th root 
of unity, it follows that 



(f[±a0\ k )(T) = (±l) fe J2 - e 2 ^^' 

n=0 



and all such expansions are equally plausible Fourier series of / at s. In par- 
ticular, when k is odd the leading coefficient is only determined up to sign, 
and thinking of f(s) as a 0 does not give it a well defined value. What is well 
defined is whether ao is 0, and so the intuition that a cusp form vanishes at 
all the cusps makes sense. 

For more examples of modular forms with respect to congruence sub- 
groups, start from the weight 2 Eisenstein series 



cezdez' v ' 



where Z' c = Z — {0} if c = 0 and Z' c = Z otherwise. This series converges only 
conditionally, but the terms are arranged so that in specializing equation (1.2) 
to k = 2, the ensuing calculation remains valid to give 

oo 

G 2 (t) = 2C(2) - 8tt 2 J2 1 = ^\ = E d 

n=l d|n 

d>a 

(Exercise 1.2.8(a)). Conditional convergence keeps G 2 from being weakly mod- 
ular. Instead, a calculation that should leave the reader deeply appreciative 
of absolute convergence in the future shows that 



27T7 C 

(G 2 [ 7 ] 2 )(t)=G 2 (t)-— - for 7 = 
ct + a 



a b 

c d 



€ SL 2 (Z) (1.4) 



(Exercise 1.2.8(b-c)). The corrected function G 2 (t) — 7r/Im(r) is weight-2 in- 
variant under SL 2 (Z) (Exercise 1.2.8(d)), but it is not holomorphic. However, 
for any positive integer N, if 

G 2 , n (t) = G 2 (t) - NG 2 (Nt) 

then G 2 ,A r € .M 2 (rb(iV)) (Exercise 1.2.8(e)). We will see many more Eisen- 
stein series in Chapter 4. 

Weight 2 Eisenstein series solve the four squares problem from the be- 
ginning of the section. The modular forms G 2j2 and G 2 ,4 work out to (Exer- 
cise 1.2.9) 
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G 2 ,M = -y(l + 24£( E d y) 

and 



n=l 0<d|n 
(i odd 



G 2)4 (r) = -7r 2 (l + 8^( £ d)« n )- 

1=1 0<d|n 

Now, G 2 ,2 € .M 2 (-T 0 (2)) C A^2(^o(4)) (the smaller group allows more wcight- 
2 invariant functions and the other conditions in Definition 1.2.3 make no refer- 
ence to a congruence subgroup, so the smaller group has more modular forms) 
and G 2 , 4 G M 2 (r 0 (4)). Exercise 3.9.3 will show that dim(Af 2 (r 0 (4))) = 2, so 
G 2j2 and G 2j4 , which visibly are linearly independent, are a basis. Recall that 
the function 9(t, 4) also lies in the space A^ 2 (/o(4)). Thus 0 = aG 2 . 2 + bG 2y 4 
for some a, b € C, and the expansions 

0(t,4) = l + 8« + -- - , 

-\g 2 . 2 {t) = l + 24q +••• , 

-4 G 2. 4 (r) = l + 8g + -- , 

show that 0(t, 4) = — (l/-7r 2 )G 2i 4(r). Equating the Fourier coefficients gives 
the representation number of n as a sum of four squares, 



-(n,4) = 8 ^ d, n > 1. 



0<<i|n 

In particular, if 4 { n then r(n,4) = 8cri(n). The two squares problem, the 
six squares problem, and the eight squares problem are solved similarly once 
additional machinery is in place. For any even s > 10 the same methods 
give an asymptotic solution f(n, s) to the s squares problem, meaning that 
limbec r(n, s)/r(n, s) = 1. Exercise 4.8.7 will discuss all of this. 

For another application of weight 2 Eisenstein series, first normalize G 2 to 



E2{t) = 2^y = 1 - 24 E^ n - 



n=l 



Then equation (1.4) with 7 = [° J] specializes to 

t-*E 2 (-1/t) = E 2 (t) + ^-. (1.5) 

The Dedekind eta function is the infinite product 

00 

V(r) = q 2 4 n (! 9"), 924 = e 27rW24 , 9 = e 2 ™ . 



n=l 
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Since the series S(t) — Yl^Li l°g(l — Q n ) converges absolutely and uniformly 
on compact subsets of T-L (Exercise 1.2.10(a)), a theorem from complex anal- 
ysis (see for example [JS87]) says that r\ is holomorphic on W and satisfies 
the logarithmic differentiation formula for products, (logjln fn)' — J2 n fn/fn 
(Exercise 1.2.10(b)). The eta function satisfies a relation similar to that sat- 
isfied by the theta function. 

Proposition 1.2.5. The Dedekind eta function satisfies the transformation 
law 

t](—1/t) = \J — iti](t), t € H. 



Proof. Compute the logarithmic derivative 

j ■ 00 j d ■ oo oo 

d=l q d=l 771=1 



dr 



It follows that 



oo oo . oo / 

m=l d=l 71=1 \0<d\n 



ilog(,(-l/T)) = gr- 2 E 2 (-l/T) 



and 



i / / — — 1 7ri _ , . 7ri / 12 \ 

log(V=i^(r)) = ^ + ^(r) = - ^ 2 (r) + — j . 



These are equal by (1.5), so the desired relation holds up to a multiplicative 
constant. Setting r = i shows that the constant is 1. □ 

The function 

oo 

V 2 \r) = ql[(l- q n r, q = e 



27TIT 



is invariant under r i->- t + 1 and satisfies r/ 24 (— 1/r) = t 12 w 24: (t). Also, 
limi m ( T )_ > . 00 ?7 24 (t) — 0, so ?7 24 e 5i2(SL 2 (Z)). We will see that this space, 
which also contains the discriminant function A from Section 1.1, is 1-dimcn- 
sional. Comparing the leading terms of the Fourier expansions »7 24 (t) = q+- ■ ■ 
and A(t) = (2n) 12 q + ■ ■ ■ thus gives the remarkable identity 

A={2*f 2 r ] 2 \ 

or 

(c,d) (c,d) ra=l 
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More on the eta function, especially its connection with theta functions per 
the beginning of this section, is in Chapter 1 of [Bum97]. 

For a more general example, to make more modular forms with respect 
to congruence subgroups, start from any modular form /. Let d be a positive 
integer and let g(r) = f(dr). Then g is a modular form with respect to r 0 (d). 
The same process takes modular forms with respect to Jo (AT) to modular 
forms with respect to r 0 (dN) and takes modular forms with respect to A(-W) 
to modular forms with respect to Ji(dAT) (Exercise 1.2.11). 

Exercises 

1.2.1. Show that r(n, k) = J2i+ m =n i)r(m, j) when i + j = k. 

1.2.2. Let iV be a positive integer. Let 7 € SL 2 (Z/7VZ) be given. Lift 7 to a 
matrix [»J]eM 2 (Z). 

(a) Show that gcd(c, d, N) = 1. Show that gcd(c',d') = 1 for some c' = 
c + sN and d' = d + tN where s,<eZ. (A hint for this exercise is at the end 
of the book.) 

(b) Show that some lift [ a+ * N b+ j N ] of 7 lies in SL 2 (Z). Thus the map 
SL 2 (Z) — ► SL 2 (Z/NZ) surjecte. 

1.2.3. (a) Let p be a prime and let e be a positive integer. Show that 
|SL 2 (Z/p e Z)| = p 3e (l - l/p 2 ). (Hints for this exercise are at the end of the 
book.) 

(b) Show that |SL 2 (Z/A^Z)| = N 3 U p \n( 1 - 1 /p 2 ), so thi s is also the index 
[SL 2 (Z) : r(N)\. 

(c) Show that the map A (AT) — ► Z/NZ given by [ a c b d ] ^ b (mod N) 
surjects and has kernel r(N). 

(d) Show that the map T 0 (A^) — > (Z/NZ)* given by [ a c b d ] ^ d (mod N) 
surjects and has kernel ri(N). 

(e) Show that [SLa(Z) : r 0 (N)] = JVn p |jv(l + Vp)- 

1.2.4. Let r$ be the group generated by the matrices ± [ 0 \ ] and ± [ \ J]. This 
exercise shows that Je = J~b(4). The containment "C" is clear. For the other 
containment, let a = [" d ] be a matrix in Jo (4). Similarly to Exercise 1.1.1, 
the identity 



a b 




'1 n 




'a b' 


c d 




0 1 




c nc+ d 



shows that unless c = 0, some matrix 0:7 with 7 G Jg has bottom row (c', d') 
with \d'\ < \c'\/2. (The inequality is strict because d = 0 (mod 4) and <f is 
odd.) Use the identity 



a b 




' 1 0" 




a' 6" 


c d 




An 1 




c + 4nd d 
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to show that unless d = 0, some matrix cry with 7 g fj has bottom row 
(c',d') with |c'| < 2|<f |. (The inequality is strict because of properties of c' 
and d' modulo 4.) Each multiplication reduces the positive integer quantity 
min{|c|, 2|rf|}, so the process must stop with c — 0 or d = 0. Show that in fact 
this means cry £ I0 for some j £ Tg and so a e Jg. 

1.2.5. If r is a congruence subgroup of SL 2 (Z) and 7 £ SL 2 (Z), show that 
7~ 1 / n 7 is again a congruence subgroup of SL 2 (Z). 

1.2.6. This exercise proves Proposition 1.2.4. Let r be a congruence subgroup 
of SL 2 (Z), thus containing r(N) for some N, and suppose that the function 
/ : W — > C is holomorphic and weight-fc invariant under r. Suppose also 
that in the Fourier expansion /(r) = X^^Ln 0 ™^' the coefficients for n > 0 
satisfy |a„| < Cn r for some positive constants C and r. 

(a) Show that for any r = x + iy e "H, 

00 

|/(r)|<|a 0 | + C^n'' e - 2 ™^ v . (1.6) 

ra=l 

Changing n to a nonncgative real variable t, show that the continuous version 
g(t) = t r e - 2 nt y /N Q £ summanc i increases monotonically on the interval 
[0, and then decreases monotonically on [f^,oo)- Using this, represent 
all but two terms of the sum in (1.6) as unit-wide boxes under the graph of g 
and consider the missing terms individually to establish the estimate (where 
in this exercise C 0 and C can denote different constants in different places) 

\f(T)\<C 0 + c(J^g(t)dt+l/y r ). 

After a change of variable the integral takes the form C/y r+1 ■ J^ Q t r e^ t dt. 
This last integral is a gamma function integral, to be introduced formally in 
Section 4.4, but at any rate it converges at both ends and is independent of y. 
In sum, 

|/(r)| <C 0 + C/y r asy^^. 

(b) For every a £ SL 2 (Z), the transformed function (/[a]fc)(r) is holo- 
morphic and weight-fc invariant under a^ l Ta and therefore has a Laurent 
expansion 

(/M*)(r) = 5>n<fc' q N = e^/ N . 

n£Z 

To show that the Laurent scries truncates from the left to a power series it 
suffices to show that 

lim ((f[a] k )(r)-q N )=0. 

ijjv-s-0 

If a fixes 00 then this is immediate from the Fourier series of / itself. Otherwise 
show that the transformed function (f[a]k)(r) = (cr + d)~ k /(a(r)) satisfies 
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lim \(f[a] k )(r) ■ q N \ <C lim (y r - k \q N \). 

Recalling that <? w = e 2ni ^ x+iy ^ N , show that y = Clog(l/|gjv|), and use the 
fact that polynomials dominate logarithms to complete the proof. (A hint for 
this exercise is at the end of the book.) 

1.2.7. Show that (/[±a/3] fe )(r) = (±l) fe (/[a] fc )(r + j) in the comment after 
Proposition 1.2.4. 

1.2.8. (a) Verify the Fourier expansion of G 2 . 

(b) Suppose that G 2 satisfies the identity in (1.4) for two particular matri- 
ces 71,72 € SL 2 (Z). Show that G 2 then satisfies the identity for the product 
7172 and the inverse 7f 1 as well. Thus to establish (1.4) it suffices to prove 
the identity for a set of generators for SL 2 (Z). (Hints for this exercise are at 
the end of the book.) 

(c) Recall that SL 2 (Z) is generated by [ \ \ ] and [ " ~ J ] . The Fourier expan- 
sion for G 2 shows that it satisfies (1.4) for 7 = [J}]. For the other generator, 
show that 

(G a [[? "S]] a )(r) = t- 2 G 2 (-1/t) = J2 E 

deZc<£Z' d y ' 



which differs from G 2 (t) = 2£(2) + E^ CT c ^~ 2 * n ^ e reverse d order of 
summation. Next use partial fractions to show that 

]2I2 z{cT+d){cT+d+1) = 0 - 

Subtract this from G 2 to get 

a(r) = 2<(2)+ S ^ + ^ + * + i) 

where now the sum is absolutely convergent by Exercise 1.1.4 and the limit 
comparison test, and thus it can have its order of summation reversed. Doing 
so and separating the convergence terms back out shows that 

ftW _^ ( . 1/r) .gg__l_. 

The error term is — lim > > f , ) . Reverse the order 

AM-oo /-^ \cT + d CT + d+lJ 

d=-N c^O x 7 

and the inner sum telescopes. Manipulate the result into an expression in- 
cluding a sum for it cot(ir N/t) per the first equality of (1.1). Finally, use the 
other side of (1.1) to take the limit. 
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(d) Show that 

G SL 2 (Z), 



j (7 j T ) 2 I m (7( r )) Im(r) cr + 



27TIC 

tor 7 = 



a b 
c d 



and therefore the function G2(t)— 7r/Im(r) is weight-2 invariant under SL 2 (Z). 

(e) Show that G 2 ,at e M 2 (r 0 (iV)). 

(f) Compute G 2j Ar(i) and G 2j at(/u 3 ) where ^3 = e 27 ™/ 3 . 

1.2.9. Verify the Fourier expansions of G 2j2 and G 2j 4 given in the section. (A 
hint for this exercise is at the end of the book.) 

1.2.10. (a) Show that the series S(t) — Yl^Li l°g(l — Q n ) converges absolutely 
and uniformly on compact subsets of H. (A hint for this exercise is at the end 
of the book.) 

(b) Show the logarithmic differentiation formula (log(J|/n))' = X) fU /« 
for finite products. The analogous result for infinite products applies to the 
Dedekind eta function. 

1.2.11. Let GL^ (Q) be the group of 2-by-2 matrices with positive determi- 
nant and rational entries. The elements of this group act as fractional linear 
transformations on the upper half plane since Im(7(r)) = det 7-Im(r)/|cT+<i| 2 
for 7 G GL^ (Q). Let k be a positive integer. For r € %, extend the formula 
j(7, t) = cr + d to 7 G GL2 (Q), and extend the weight-/c operator to GL^ (Q) 
by the rule 

(/[7]fc)W - (det 7)^^(7, r)- fe /(7(r)) for /:«—►& 

(a) Show that [77'] & = Hfcb'Jfc for all 7,7' G GL^Q), generalizing 
Lemma 1.2.2(b). 

(b) Show that every 7 G GL^Q) satisfies 7 = 07' where a G SL 2 (Z) 
and 7' = r[§J] with r G Q + and a, 6, ci G Z relatively prime. Use this to 
show that given / G Mu{r) for some congruence subgroup I 1 and given such 
7 = 07', since has a Fourier expansion, so does /[7]fc- Show that if 
the Fourier expansion for f[o]k has constant term 0 then so does the Fourier 
expansion for f["f]k- (A hint for this exercise is at the end of the book.) 

(c) Suppose that J\ and -T 2 are congruence subgroups of SL 2 (Z) and sup- 
pose that ri D 7-T 2 7~ 1 for some 7 G GL^ (Q). Prove that if / G Alfc(A) then 
/[7]fe G Aifc(r 2 ). Show that the same result holds for cusp forms. 

(d) Let N and d be positive integers. Suppose / G A / Jfe(Io(iV)). Show that 
if g(r) = f(dr) then g G Mk(ro(dN)), and the same result holds for cusp 
forms. Show that if / G Mk(ri(N)) then g G M k (ri(dN)), and the same 
result holds for cusp forms. 
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This section gives a sketch of results about complex tori, also known as com- 
plex elliptic curves for reasons to be explained in the next section. Since the 
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material is covered in many texts (for example, [JS87] is a very nice second 
complex analysis book) , some facts will be stated without proof. Complex tori 
will be related to modular forms in Section 1.5. 

A lattice in C is a set A = u\L © with {wi, 0J2} a basis for C over R. 
We make the normalizing convention LO1/LO2 € H, but this still does not specify 
a basis given a lattice. Instead, 

Lemma 1.3.1. Consider two lattices A = ui\Z © w 2 Z and A' = ui^Z © w 2 Z 

with u)i/oj 2 € W and co^/lo^ € H. Then A' = A if and only if 







a b 






for some 


a b 






c d 




c d 


j 2 _ 






oj 2 



G SL 2 (Z). 



Proof. Exercise 1.3.1. □ 

A complex torus is a quotient of the complex plane by a lattice, 

C/A={z + A: z e C}. 

Algebraically a complex torus is an Abelian group under the addition it in- 
herits from C. Geometrically a complex torus is a parallelogram spanned by 
{u)\,u)2\ with its sides identified in opposing pairs. Identifying one pair of 
sides rolls the parallelogram into a tube, and then identifying the other pair 
bends the tube into a torus. But the flat model of the complex torus with 
neighborhoods extending across the sides (see Figure 1.2) better illustrates 
that every complex torus is a Riemann surface, roughly meaning a connected 
set that looks like the complex plane C in the small. The precise definition of 
a Riemann surface will be demonstrated by example in Chapter 2. 




Figure 1.2. A complex torus 

The notion of a holomorphic map makes sense for Riemann surfaces since 
it is local. Any holomorphic map between compact Riemann surfaces is cither 
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a surjection or a map to one point. To see this, suppose X and Y are compact 
Ricmann surfaces and / : X — > Y is holomorphic. Since / is continuous 
and X is compact and connected, so is the image f(X), making f(X) closed. 
Unless / is constant / is open by the Open Mapping Theorem of complex 
analysis, applicable to Riemann surfaces since it is a local result, making 
f(X) open as well. So f(X) is either a single point or a connected, open, 
closed subset of the connected set Y, i.e., all of Y. As a special case of this, 
any nonconstant holomorphic map from one complex torus to another is a 
surjection. 

Proposition 1.3.2. Suppose p : C/A — > C/A' is a holomorphic map be- 
tween complex tori. Then there exist complex numbers m, b with raA C A 1 such 
that ip(z + A) = mz + b + A'. The map is invertible if and only if mA = A! . 

Proof. (Sketch.) The key is to lift p to a holomorphic map <p : C — > C 
by using topology. (The plane is the so-called universal covering space of the 
torus — see a topology text such as [MunOO] for the definition and the relevant 
lifting theorem.) With the map lifted, consider for any A e A the function 
fx(z) = p(z + A) — ip(z). Since p lifts a map between the quotients, the 
continuous function f\ maps to the discrete set A' and is therefore constant. 
Differentiating gives <p'(z+X) = <p'(z). Thus ip' is holomorphic and /1-periodic, 
making it bounded and therefore constant by Liouville's Theorem. Now p is 
a first degree polynomial ip(z) = mz + b, and again since this lifts a map 
between quotients, necessarily mA C A'. The original map thus has the form 
asserted in the proposition. If the containment mA C A' is proper then p is 
not injective: some z £ A' satisfies z/m ^ A but p>(z/m + A) = b + A' = <p(A). 
If mA = A' then (l/m)/T = A and the map tp : C/A' — > C/A given by 
ip(w + A') = (w — b)/m + A inverts p. □ 

Corollary 1.3.3. Suppose p : C/A — > C/A' is a holomorphic map between 
complex tori, p(z + A) = mz + b + A' with mA C A'. Then the following are 
equivalent: 

(1) p is a group homomorphism, 

(2) b e A', so p(z + A) = mz + A', 

(3) p(0) = 0. 

In particular, there exists a nonzero holomorphic group homomorphism be- 
tween the complex tori C/A and C/A' if and only if there exists some nonzero 
m G C such that mA C A', and there exists a holomorphic group isomorphism 
between the complex tori C/A and C/A' if and only if there exists some m e C 
such that mA = A' . 

Proof. Exercise 1.3.2. □ 

For one isomorphism of particular interest, start from an arbitrary lattice 
A = wiZ © w 2 Z with ui/u} 2 G U. Let r = u>i/u> 2 and let A T = tZ ® Z. Then 
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since (1/0)2) A = A T , Corollary 1.3.3 shows that the map <p T : C/A — > C/A T 
given by ip(z + A) = z/oj 2 + A T is an isomorphism. This shows that every 
complex torus is isomorphic to a complex torus whose lattice is generated 
by a complex number r£M and by 1. This r is not unique, but if t' £ W 
is another such number then t' = o)'i/o)' 2 where A = wJZ (fi WjZ, and so 
by Lemma 1.3.1 t' = -f(r) for some 7 € SL 2 (Z). Thus each complex torus 
determines a point r e % up to the action of SL 2 (Z). Section 1.5 will show 
as part of a more general argument that in fact the isomorphism classes of 
complex tori biject to the orbits SL2(Z)r in %. 

Definition 1.3.4. A nonzero holomorphic homomorphism between complex 
tori is called an isogeny. 

In particular, every holomorphic isomorphism is an isogeny. Every isogeny 
surjects and has finite kernel the kernel is finite because it is discrete (oth- 
erwise complex analysis shows that the map is zero) and complex tori are 
compact. 

Multiply-by-integer maps are isogenies but not isomorphisms. For any pos- 
itive integer N and lattice A consider the map 

[N]:C/A—*C/A, z + A^Nz + A. 

This is an isogeny since NA C A. Its kernel, the points z + A e C/A such that 
[N](z + A) = 0, is the set of N -torsion points of C/A, a subgroup isomorphic 
to Z/NZ x Z/NZ. (See Figure 1.3.) Letting E denote the torus C/A (for 
reasons to be explained soon), this subgroup is denoted E[N]. 




0 



Figure 1.3. E[5]: the 5-torsion points of a torus 

Cyclic quotient maps are also isogenies but not isomorphisms. Let C/A be 
a complex torus, let N be a positive integer, and let C be a cyclic subgroup 
of E[N] isomorphic to Z/NZ. The elements of C are cosets {c + A} and so as 
a set C forms a supcrlattice of A. Slightly abusing notation we use the same 
symbol for the subgroup and the superlattice. Then the cyclic quotient map 

7T : C/A — -> C/C, z + A^z + C 
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is an isogeny with kernel C. (See Figure 1.4, where A is of the form A T and 
the kernel is C = (1/5)Z + A) 




Figure 1.4. A quotient isogeny with cyclic kernel of order 5 

In fact every isogeny is a composition of the examples already given. To 
see this, consider an arbitrary isogeny 

(p : C/A — >C/A', z + A^mz + A' 

and let K denote its kernel, the finite subgroup K = mT 1 A! / 'A of C/A also 
viewed as the superlattice K = mT 1 A' of A. If TV is the order of if as a 
subgroup then K C E[N] = Z/NZ x Z/NZ, and so by the theory of finite 
Abclian groups K = Z/nZ x Z/nn'Z for some positive integers n and n' . The 
multiply-by- n isogeny [n] of C j A takes if to a cyclic subgroup nK isomorphic 
to Z/n'Z, and then the quotient isogeny it from C/A to C/nK has kernel 
nK. Follow this by the map C/nK — > C/A' given by z + nK \-t (m/n)z + 
(m/n)nK , now viewing nK as a lattice in C. This map makes sense and is 
an isomorphism since (m/n)nK = mK = A'. The composition of the three 
maps is z ~\~ A i ^ nz ~\~ A i ^ nz ~\~ nK mz + A' = ip(z + A). That is, the 
general isogeny is a composition as claimed, 

fp : C/A ^% C/A C/nK C/A'. 

A very similar argument shows that isogeny is an equivalence relation. 
Suppose that ip : C/A — > C/A' is an isogeny. Thus ip(z + A) = mz + A' where 
to 7^ 0 and myl C A'. By the theory of finite Abelian groups there exists a 
basis {dJi,w 2 } of A' and positive integers m, n-i such that {niWi, ^2^2} is a 
basis of to/1. It follows that n\n^A' C to/1 and therefore (nin2/m)A' C A. 
Thus there is a dual isogeny (p : C/A' — > C/A back in the other direction, 
ip(z + A') = (nin 2 /m)z + A. Note that (ip o ip)(z + A) = nin 2 z + A, i.e., the 
isogeny followed by its dual is multiplication by a positive integer. The integer 
n\n 2 in question is the degree of the original isogeny since {wi/m, ^/m} is 
a basis of ker(ip) and {ni^i/m, n 2 <^2/^} is a basis of A, making kcr(ip) = 
Z/n\Z x Z/n 2 Z and showing that ip is riin 2 -to-l. That is, 

ipoip= [deg(»]. 

This condition specifics ip uniquely since ip surjects. Since the map [deg(i^)] 
has degree (deg(</?)) 2 and the degree of a composition is the product of the 
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degrees, the dual isogeny has degree deg(ip) = deg(<p>). The dual isogeny of a 
multiply-by-intcgcr map is itself. The dual isogeny of a cyclic quotient isogeny 
quotients the torus in a second direction by a cyclic group of the same order 
to restore its shape and then expands it back to full size. The dual isogeny 
of an isomorphism is its inverse. The dual of a composition of isogenies is the 
composition of the duals in the reverse order. If ip is an isogeny and <p is its 
dual then the formulas ip(z + A) = mz + A' , (p(z! + A') = (deg(ip) /m)z' + A 
show that also 

<P° <P = [deg(»] = [deg(£)], 

so that ip is in turn the dual isogeny of its dual ip. Isogeny of complex tori, 
rather than isomorphism, will turn out to be the appropriate equivalence 
relation in the context of modular forms. 

If ipi, ipz ■ C/A — > C/A' are isogenies, and <p\ + ip 2 7^ 0 so that their sum 
is again an isogeny, then the dual of the sum is the sum of their duals. To see 
this, let ip : C/A — > C/A' be an isogeny. Thus <p(z + A) = mz + A' where 
m 7^ 0 and mA C A'. Let A = oj{L © w 2 Z with U1/0J2 <= % and similarly 
for A'. Consequently 



moo 1 








= a 




mu)2 




J 2 _ 



for some a 



a b 

c d 



G M 2 (Z). 



Homogenizing this equality gives lo\/u2 — a^^/w^) where now a acts as a 
fractional linear transformation, showing that det a/0 and hence det a > 0 
because in general Im(a(r)) = det a • Im(r)/| j(a, r) | 2 for a e GL 2 (R). This 
justifies the last step of the calculation 

deg(ip) = I ker(p) \ = [mT 1 A' : A] = [A' : mA] = det a. 

Since ip o <p — [deg(<^)] and the matrix of a composition is the right-to-left 
product of the matrices, the dual isogeny must induce the matrix 



a = det a ■ a 



d-b 
-c a 



and conversely this matrix determines the dual isogeny. Now let ipi and p>2 be 
isogenies from C/A to C/A! with <p>\ +<p> 2 7^ 0- Their sum (ipi + (p 2 )(z + A) = 
(mi + m 2 )z + A' gives rise to the matrix 



a-i + a 2 b 1 + b 2 
a + c 2 di + d 2 



and so correspondingly the dual isogeny of the sum is determined by the 
matrix 

d\ + d 2 -bi - b 2 
— ci — c 2 ai + a 2 



&i + &2, 



the sum of the matrices determining the dual isogenies. This proves the claim 
at the beginning of the paragraph, 
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^1+^2 = ^1+^2 if ^1+^2 7^0. (1.7) 

We will cite this fact in Chapter 6. 

For one more example, some complex tori have endomorphisms other than 
the multiply-by- N maps [N], in which case they have complex multiplication. 
Let r = \fd for some squarefree d € Z such that d = 2,3 (mod 4), or 
let t = (—1 + \fd)/2 for squarefree d € Z , d = 1 (mod 4). Then the set 
O = tZ ® Z is a ring. (Readers with background in number theory will 
recognize it as the ring of integers in the imaginary quadratic number field 
Q(Vd).) Let A be any ideal of O and let m be any element of O. Then mA C A, 
so multiplying by m gives an endomorphism of C/A. In particular, the ring 
of endomorphisms of C/Ai is isomorphic to A^ = VL 0 Z rather than to Z, 
and similarly for the ring of endomorphisms of C/A^ where ^3 = e 27 ™/ 3 . 

Let A be a lattice. The TV-torsion subgroup of the additive torus group C/A, 

E[N] = {Pe C/A : [N]P = 0} = fa/N + A) x (wt/N + A), 

is analogous to the TV-torsion subgroup of the multiplicative circle group 
C*/R+ ^ {z e C : \z\ = 1} ^ R/Z, the complex Nth roots of unity 

(i N = {zeC: z N = l}= (e 2m / N ). 

A sort of inner product exists on E[N] with values in [i N , the Weil pairing 

e N : E[N] x E[N] — > /j, N . 

To define this, let P and Q be points in E[N], possibly equal. If A = cjiZ®o;2Z 
with u>i/u>2 G H then 



ujx/N + A 
us 2 /N + A 



for some 7 G M 2 (Z/iVZ) 



since uji/N + A and ^/^V + ^l generate 75 [AT]. The Weil pairing of P and Q is 

This makes sense even though det7 is defined only modulo N. It is inde- 
pendent of how the basis {w 1 ,w 2 } is chosen (and once the basis is chosen 
the matrix 7 is uniquely determined since its entries are reduced modulo N), 
remembering the normalization uj\/uj2 € % (Exercise 1.3.3(a)). If P and Q 
generate E [N] then the matrix 7 lies in the group GL2(Z/7VZ) of invert- 
ible 2-by-2 matrices with entries in Z/7VZ, making det7 invertible modulo iV 
and e^v {P, Q) therefore a primitive complex iVth root of unity. See Exer- 
cise 1.3.3(b-d) for more properties of the Weil pairing, in particular that the 
Weil pairing is preserved under isomorphisms of complex tori. We will use the 
Weil pairing in the Section 1.5. 
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Exercises 

1.3.1. Prove Lemma 1.3.1. 

1.3.2. Prove Corollary 1.3.3. (A hint for this exercise is at the end of the 
book.) 

1.3.3. (a) Show that the Weil pairing is independent of which basis {wi,oj 2 } 
is used, provided u>i/u>2 <= H- 

(b) Show that the Weil pairing is bilinear, alternating, and nondegenerate. 
(Remember that the group fi N is multiplicative.) 

(c) Show that the Weil pairing is compatible with N. This means that for 
positive integers N and d, the diagram 

E[dN] x E[dN] > H dN 

d( v ) 

E[N] x E[N] > Mat 

commutes, where the vertical maps are suitable multiplications by d. (A hint 
for this exercise is at the end of the book.) 

(d) Let A and A' be lattices with mA = A' for some m € C. Show that 
the isomorphism of complex elliptic curves C/A C/A' given by z + A i-> 
mz + A' preserves the Weil pairing. 



1.4 Complex tori as elliptic curves 

This section shows how complex tori C /A can also be viewed as cubic curves of 
the sort mentioned back in the preface. These cubic curves are called elliptic 
despite not being ellipses, due to a connection between them and the arc 
length of an actual ellipse. The presentation here is terse, so the reader may 
want to consult a relevant complex analysis text such as [JS87]. 

The meromorphic functions on a complex torus are what relate it to a 
cubic curve. Given a lattice A, the meromorphic functions f : C/A — > C on 
the torus are naturally identified with the /1-periodic meromorphic functions 
/ : C — ► C on the plane. Exercise 1.4.1 derives some basic properties of these 
functions in general. The most important specific example is the Weierstrass 
p-function 

^) = i + E'((^)2-^ N )' zeC,ztA. 



(The primed summation means to omit uj = 0.) Subtracting 1/oJ 2 from 
l/(z — uj) 2 makes the summand roughly z/uj 3 , cf. the sketched proof of Propo- 
sition 1.4.1 to follow, so the sum converges absolutely and uniformly on com- 
pact subsets of C away from A. Correcting the summand this way prevents 
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the terms of the sum from being permuted when z is translated by a lattice 
element, so p doesn't obviously have periods A. But the derivative 

p'(z) = -2V . 1 

^ (z - w) 3 

clearly does have periods A, and combining this with the fact that p is an even 
function quickly shows that in fact p has periods A as well (Exercise 1.4.2). It 
turns out that p and p' are the only basic examples we need since the field of 
meromorphic functions on C/A is C(p, p'), the rational expressions in these 
two functions. Since the Weierstrass p-function depends on the lattice A as 
well as the variable z we will sometimes write Pa{z) and p' A (z); in particular 
for lattices A T we will write p T (z) and p' T (z). 

Eisenstein series generalize to functions of a variable lattice, 

v^' 1 

G k (A) = 2_^^k> k>2 even > 

so that Gfc(r) from before is now Gk(A T ). As lattice functions Eisenstein 
series satisfy the homogeneity condition Gk(mA) = mr k Gk(A) for all nonzero 
m G C. Part (a) of the next result shows that these lattice Eisenstein series 
appear in the Laurent expansion of the Weierstrass p-function for A. Part (b) 
relates the functions p(z) and p'(z) in a cubic equation whose coefficients are 
also lattice Eisenstein series. 

Proposition 1.4.1. Let p be the Weierstrass function with respect to a lattice 
A. Then 

(a) The Laurent expansion of p is 

PW = ? + E (n+l)G n+2 (A)z n 

n=2 
n even 

for all z such that 0 < \z\ < inf{|w| : u e A - {0}}. 

(b) The functions p and p' satisfy the relation 

(p'(z)) 2 = 4(p(z)) 3 -g 2 (A)p(z)-g 3 (A) 

where g 2 (A) = 60G 4 (/1) and g 3 (A) = U0G 6 (A). 

(c) Let A = luiZ(Blu2 I Z' and let uj 3 = u>i + lJ2- Then the cubic equation satisfied 
by p and p' , y 2 = 4x 3 - g 2 (A)x - g 3 (A), is 

V 2 = 4(z - e 1 )(x - e 2 )(x - e 3 ), e 4 = p(w,/2) for i = 1,2,3. 

This equation is nonsingular, meaning its right side has distinct roots. 
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Proof. (Sketch.) For (a), if \z\ < \cj\ then 

(z-lu) 2 uj 2 uj 2 \(1-z/lo) 2 J 

and the geometric series squares to X)^Lo( n + l) z ™/ w ™> making the summand 
2z/uj 3 + 3z 2 /uj 4 + • • • . Convergence results from Exercise 1.1.4 allow the re- 
sulting double sum to be rearranged, and then the inner sum cancels when n 
is odd. 

For (b), one uses part (a) to show that the nonpositive terms of the Laurent 
series of both sides are equal. Specifically, since 

p{z) = ^ + 3G 4 (A)z 2 + 5G 6 (A)z 4 + 0(z 6 ) 

(where "0" means "a quantity on the order of") and 

p'(z) = ~ + 6G 4 (A)z + 20G 6 (A)z 3 + 0(z 5 ), 

z a 

a little algebra shows that both (p'(z)) 2 and 4(p(z)) i — gi{A)p(z) — ():>,(A) 
work out to 4/z 6 - 2AGi(A)/z 2 - 80G 6 (/1) + <D(z 2 ). So their difference is 
holomorphic and /1-periodic, therefore bounded, therefore constant, therefore 
zero since it is 0(z 2 ) as z — > 0. 

For (c), since p' is odd, it has zeros at the order 2 points of C/A: if 
z = —z (mod A) then p'(z) = p'{—z) = —p'{z) and thus p'(z) = 0. Letting 
A = oj\Z 0 W2Z, the order 2 points are Zi — uJi/2 with p'(zi) = 0 for i = 
1, 2, 3. The relation between p and p' from (b) shows that the corresponding 
values Xi — p{zi) for i = 1,2,3 are roots of the cubic polynomial pa{x) — 
4x 3 — g2(A)x — 53(^1), so it factors as claimed. Each double value of p 

since p'{z{) = 0, and since p has degree 2, meaning it takes each value twice 
counting multiplicity (see Exercise 1.4.1(b)), this makes the three Xi distinct. 
That is, the cubic polynomial pa has distinct roots. □ 

Part (b) of the proposition shows that the map z i-> (pj\{z), p'a{z)) takes 
nonlattice points of C to points (x, y) € C 2 satisfying the nonsingular cubic 
equation of part (c), y 2 = 4x 3 —g2(A)x—g 3 (A). The map bijects since generally 
a value x <G C is taken by pA twice on C/A, that is, x = p^{±z + A), and then 
the two y-values satisfying the cubic equation are p'(±z + A) = ±p'(z + A). 
The exceptional x- values where y = 0 occur at the order-2 points of C/A, 
so they are taken once by pA as necessary. The map extends to all z e C 
by mapping lattice points to a suitably defined point at infinity. In sum, for 
every lattice the associated Weierstrass p-function and its derivative give a 
bijection 

(p, p') : complex torus — > elliptic curve. 

For example, the value gs(i) = 0 from Section 1.1 shows that the complex 
torus C/Ai bijects to the elliptic curve with equation y 2 = 4a; 3 — g2(i)x. 
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Similarly the complex torus C//1 M3 (where again fj, 3 = e 27 ™/ 3 ) bijects to the 
elliptic curve with equation y 2 = 4x 3 — g 3 (fi 3 ). See Exercise 1.4.3 for some 
values of the functions p and p' in connection with these two lattices. 

The map (p, p') transfers the group law from the complex torus to the ellip- 
tic curve. To understand addition on the curve, let z\+A and z 2 +A be nonzero 
points of the torus. The image points (p(zi), p'(zi)) and (p(z 2 ), p'fa)) on the 
curve determine a secant or tangent line of the curve in C 2 , ax + by + c = 0. 
Consider the function 

f(z) = ap(z) +bp'(z) +c. 

This is meromorphic on C/A. When b ^ 0 it has a triple pole at 0 + A and 
zeros at zi + A and Z2 + A, and Exercise 1.4.1(c) shows that its third zero is at 
the point z 3 + A such that z\ + z 2 + 23 + A = 0 + A in C/A. When 6=0,/ has 
a double pole at 0 + A and zeros at Z\ + A and z 2 + A, and Exercise 1.4.1(c) 
shows that zi + z 2 + A = 0 + A in C/A. In this case let z 3 = 0 + A so that again 
z\ + Z2 + z 3 + A = 0 + A, and since the line is vertical view it as containing 
the infinite point (p(0), p'(0)) whose second coordinate arises from a pole of 
higher order than the first. Thus for any value of b the elliptic curve points on 
the line ax + by + c = 0 are the points (x i7 y/) = (p(^), p'(zi)) for i = 1,2, 3. 
Since Z\ + z 2 + z 3 + A = 0 + A on the torus in all cases, the resulting group 
law on the curve is that collinear triples sum to zero. 

Recall that a holomorphic isomorphism of complex tori takes the form 
z ~\~ A 1 ^ raz + A' where A' = mA. Since p m A(mz) = iti~ 2 pa(z) and 
p' mA (mz) — m^ 3 p / A (z), the corresponding isomorphism of elliptic curves is 
(x, y) 1 y (m~ 2 x, m~ 3 y) or equivalently the substitution (x, y) — (m 2 x' , m 3 y'), 
changing the cubic equation y 2 = 4x 3 — g 2 x — g 3 associated to A to the equa- 
tion y 2 = 4x 3 — m~ 4 g 2 x — m~ 6 g 3 associated to A'. Suitable choices of m 
(Exercise 1.4.3 again) normalize the elliptic curves associated to C/mAi and 
C/mA^ 3 to have equations 

y 2 = 4x(x - l)(x + 1), y 2 = 4{x - l)(x - fi 3 )(x - ^ 2 ). 

We will return to these ideas in Chapter 7. 

The appearance of Eiscnstcin series as coefficients of a nonsingular curve 
lets us prove as a corollary to Proposition 1.4.1 that the discriminant function 
from Section 1.1 has no zeros in the upper half plane. 

Corollary 1.4.2. The function A is nonvanishing on H. That is, A(t) ^ 0 
for all t E'H. 

Proof. For any t E H, specialize the lattice A in the proposition to A T . By 
part (c) of the proposition, the cubic polynomial p T {x) = 4x 3 — g 2 {r)x — g 3 {r) 
has distinct roots. Exercise 1.4.4 shows that A(t) is the discriminant of p T up 
to constant multiple (hence its name), so A(t) ^ 0. □ 
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Not only does every complex torus C/A lead via the Weierstrass p- function 
to an elliptic curve 

y 2 = 4a; 3 - a 2 x - o 3 , a% - 27a\^ 0 (1.8) 
with ai = 32 (-4) and 03 = g%{A), but the converse holds as well. 

Proposition 1.4.3. Given an elliptic curve (1.8), there exists a lattice A such 
that a 2 — g2(A) and a 3 = g 3 (A). 

Proof. The case 02 = 0 and the case 03 = 0 are Exercise 1.4.5. For the case 
a 2 7^ 0 and 03 7^ 0, since j : % — > C surjects there exists t £ H such that 
j(r) = 1728a|/(ai - 27a 2 ,). This gives 

52 (t) 3 _ a\ 



52 (t)3-27 33 (t) 2 a|-27a|' 
or, after taking reciprocals and doing a little algebra, 



(1.9) 



52(t) 3 3s(t) 2 ' 

For any nonzero complex number w 2 , let u)\ = tuj 2 and A = ui\Z © w 2 Z. Then 

g 2 (/l) = w^ 4 g 2 ("r) and 33 (yl) = ^^(r). 

Thus we are done if we can choose w 2 such that 

= a 2/g2(r) and u^ 6 = 03/53 (r). 

Choose W2 to satisfy the first condition, so that ti^T 12 = a 2 /g2(i~) 3 . Then 
by (1.9) UJ2 6 — ± a 3/S3( T ) ; an d replacing u> 2 by io>2 if necessary completes the 
proof. □ 

It follows that any map of elliptic curves (x, y) h-> (m~ 2 x, m~ 3 y), changing 
a cubic equation y 2 = 4x 3 — a2X — 03 to the equation y 2 = 4a; 3 — mT^a-ix — 
m _6 a3, comes from the holomorphic isomorphism of complex tori z + A h-> 
mz + /l' where a 2 = 52 (-4), a 3 = 03 C^)> an d -4' = mA. This makes the map of 
elliptic curves an isomorphism as well. 

Thus complex tori (Ricmann surfaces, complex analytic objects) and el- 
liptic curves (solution sets of cubic polynomials, algebraic objects) are inter- 
changeable. With the connection between them in hand, let the term complex 
elliptic curve be a synonym for "complex torus" and call meromorphic func- 
tions with periods A elliptic functions with respect to A. 

The lattice Eisenstein series Gk(A) will appear occasionally from now on 
but generally we will work with Gfc(r). 
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Exercises 

1.4.1. Let E = C/A be a complex elliptic curve where A — U\Z ® w 2 Z, and 
let / be a nonconstant elliptic function with respect to A, viewed either as 
a meromorphic function on C with periods A or as a meromorphic function 
on E. Let P = {xiu>\+X2U>2 : X\,x 2 € [0, 1]} be the parallelogram representing 
E when its opposing boundary edges are suitably identified, and let dP be 
the counterclockwise boundary of P. Since / has only finitely many zeros and 
poles in E, some translation t + dP misses them all. This exercise establishes 
some necessary properties of /. Showing that these properties are sufficient 
for an appropriate / to exist requires more work, sec for example [JS87]. 

(a) Compute that l/(2ni) J t+dP f(z)dz = 0. It follows by the Residue 
Theorem that the sum of the residues of / on E is 0. In particular there is 
no meromorphic function on E with one simple pole and so the Weierstrass 
p-function, with its double pole at A, is the simplest nonconstant elliptic 
function with respect to A. 

(b) Compute that l/(2m) f t+dP f'(z)dz/f(z) = 0. It follows by the Ar- 
gument Principle that / has as many zeros as poles, counting multiplicity. 
Replacing / by / — w for any w € C shows that / takes every value the 
same number of times, counting multiplicity. In particular the Weierstrass 
p-function on E takes every value twice. 

(c) Compute that l/(2ni) J t+dP z f'( z )dz/f(z) e A. Show that this integral 
is also X^e-E v x(f) x where v x \f) is the order of / at x, meaning that f(z) = 
(z — x) Vx ^g{z) with g(x) ^ 0. Note that v x (f) — 0 except at zeros and poles 
of /, so the sum is finite. Thus parts (b) and (c) combine to show that for any 
nonconstant meromorphic function / on E, 

Mf) = 0 in Z and ^ v x (f)x = 0 in £. 
xeE xeE 

(A hint for this exercise is at the end of the book.) 

1.4.2. Let A = loiTi ® ui 2 Z be a lattice and let p be its associated Weierstrass 
p-function. 

(a) Show that p is even and that p' is yl-periodic. 

(b) For i = 1, 2 show that the function p(z + u>i) — p(z) is some constant c» 
by taking its derivative. Substitute z = —oJi/2 to show that a — 0. Conclude 
that p is yl-periodic. 

1.4.3. Let A = Ai. The derivative p' of the corresponding Weierstrass function 
has a triple pole at 0 and simple zeros at 1/2, i/2, (1 + i)/2. Since iA = A it 
follows that 

p(iz) = piA{iz) = i~ 2 p(z) = -p(z). 

Show that in particular p((l + i)/2) = 0. This is a double zero of p since 
also p'((l + i)/2) = 0, making it the only zero of p as a function on C/A. 
Since A = A (complex conjugation) it also follows that p(z) — p(z), so that 
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p(l/2) is real, as is p(i/2) = — p(l/2). Compute some dominant terms of 
p(l/2) and p(i/2) to show that p(l/2) is the positive value. For what m 
does the complex torus C / mA correspond to the elliptic curve with equation 
y 2 =4a;(.T- 1)(.T+1)? 

Reason similarly with A — A^ 3 to find the zeros of the corresponding 
Weierstrass function p and to show that p(l/2) is real. For what m does the 
complex torus C/mA correspond to the elliptic curve with equation y 2 = 
4(x — l){x — Hs)(x — p 2 )? (A hint for this exercise is at the end of the book.) 

1.4.4. For r € % let p T {x) = 4x 3 — g 2 (r)x — g 3 (r) . Show that the discriminant 
of p T equals A(t) up to constant multiple, where A is the cusp form from 
Section 1.1. 

1.4.5. Show that when a 2 — 0 in Proposition 1.4.3 the desired lattice is A = 
mAn 3 for a suitably chosen m. Prove the case 03 = 0 in Proposition 1.4.3 
similarly. 

1.5 Modular curves and moduli spaces 

Recall from Corollary 1.3.3 that two complex elliptic curves C/A and C/A' are 
holomorphically group-isomorphic if and only if mA = A' for some m € C. 
Viewing two such curves as equivalent gives a quotient set of equivalence 
classes of complex elliptic curves. Similarly, view two points r and r' of the 
upper half plane as equivalent if and only if j(t) = r' for some 7 € SL/2(Z), 
and consider the resulting quotient set as well. This section shows that there is 
a bijection from the first quotient set to the second. That is, the equivalence 
classes of points in the upper half plane under the action of the modular 
group are described by the isomorphism classes of complex elliptic curves. 
Theorem 1.5.1 to follow shows considerably more, that the quotients of the 
upper half plane by the various congruence subgroups from Section 1.2 are 
described by the sets of equivalence classes of elliptic curves enhanced by 
corresponding torsion data. 

We begin by describing the relevant torsion data for the congruence sub- 
groups. Let N be a positive integer. An enhanced elliptic curve for r 0 (N) is 
an ordered pair (E, C) where E is a complex elliptic curve and C is a cyclic 
subgroup of E of order N. Two such pairs (E, C) and (E 1 , C) are equivalent, 
written (E, C) - (E\ C), if some isomorphism E ^ E' takes C to C The 
set of equivalence classes is denoted 

So (AO = {enhanced elliptic curves for r 0 (N)}/ ~ . 

An element of Sq(N) is denoted [E, C], the square brackets connoting equiv- 
alence class. 

An enhanced elliptic curve for i~i(iV) is a pair (E, Q) where E is a complex 
elliptic curve and Q is a point of E of order N. (Thus NQ — 0 but nQ ^ 0 
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for 0 < n < N.) Two such pairs (E,Q) and (E',Q') are equivalent if some 
isomorphism E E' takes Q to Q'. The set of equivalence classes is denoted 

Si (AT) = {enhanced elliptic curves for Pi (AT)}/ ~ . 

An element of Si (AT) is denoted [E, Q\. 

An enhanced elliptic curve for T[N) is a pair (E, (P, Q)) where £ is a 
complex elliptic curve and (P, Q) is a pair of points of E that generates the N- 
torsion subgroup E[N] with Weil pairing ejv (P, Q) = e 2ni / N . From Section 1.3 
e7v(P, Q) is some primitive complex Ath root of unity, but this condition is 
more specific. Two such pairs (E,(P,Q)) and (E',(P',Q')) are equivalent if 
some isomorphism E E' takes P to P' and Q to Q'. The set of equivalence 
classes is denoted 

S(AT) = {enhanced elliptic curves for P(A r )}/ ~ . 

An clement of S(N) is denoted [E, (P, Q)]. 

Each of So(A r ), Si(A r ), and S(N) is a space of moduli or moduli space 
of isomorphism classes of complex elliptic curves and A^-torsion data. When 
N = 1 all three moduli spaces reduce to the isomorphism classes of complex 
elliptic curves as described at the beginning of the section. 

For any congruence subgroup P of SL 2 (Z), acting on the upper half 
plane % from the left, the modular curve Y(r) is defined as the quotient 
space of orbits under P, 

Y(r) = r\n = {Pr : T e n}. 

The modular curves for r 0 (N), Pi(A r ), and P(AT) are denoted 

y 0 (n) = r 0 (N)\u, Ki(jv) = ri(jv)\w, y(n) = r(N)\n. 

Chapter 2 will show that modular curves are Riemann surfaces and they 
can be compactified. Compact Riemann surfaces are described by polynomial 
equations. Thus modular curves have complex analytic and algebraic charac- 
terizations like complex elliptic curves. Chapter 7 will show how the moduli 
spaces arise from a single elliptic curve, and it will further show that the poly- 
nomials describing io(A^) and Y\(N) have rational coefficients. For now we 
continue to work complex analytically and show that the moduli spaces map 
bijectivcly to noncompactified modular curves. Recall the lattice A T — rZffiZ 
for t £ H. Proving the following theorem amounts to checking that the torsion 
data defining the moduli spaces match the conditions defining the congruence 
subgroups. 

Theorem 1.5.1. Let N be a positive integer. 
(a) The moduli space for Pq{N) is 



S 0 (N) = {IE t ,(1/N + A t )]:t€H}. 
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Two points [E T , (1/N + A T )\ and [E r >, (1/N + A T i)} are equal if and only 
if r 0 (N)r — r 0 (N)r'. Thus there is a bijection 

Vo : S 0 (N) Y 0 (N), [C/A T , (1/N + A T )\ ^ r 0 (N)r. 

(b) The moduli space for Ti(N) is 

S 1 (N) = {[E T ,l/N + A T ]:reH}. 

Two points [E T , 1/N + A T ] and [E T i, 1/N + A T >] are equal if and only if 
Ti(N)t — Ti(N)t / . Thus there is a bijection 

: Si(JV) Yi (N) , [C/A T , 1/N + A T ] ^ A(JV)t. 

(c) The moduli space for T(N) is 

S(N) - {[C/A T , (t/N + A T , 1/N + A T )\ : r e %}. 

Two points [C/A t ,(t/N+A t ,1/N+A t )], [C/A t ,,(t' /N+A T ,,1/N+A T ,)] 
are equal if and only if T(N)t = r(N)r' . Thus there is a bijection 

V> : S(N) Y(N), [C/A T , (t/N + A T , 1/N + A T )\ ^ T(N)t. 

Proof. Parts (a) and (c) are left as Exercise 1.5.1. For (b), take any point 
[E, Q] of Si (AT). Since E is isomorphic to C/A T > for some t' £ T-L as discussed 
in Section 1.3, we may take E = C/A T >. Thus Q = (ct 1 + d)/N + A T , for 
some c, d £ Z. Then gcd(c, d, N) = 1 because the order of Q is exactly N, i.e., 
ad — be — kN = 1 for some a, b, and k, and the matrix 7 = [" J] £ M2(Z) 
reduces modulo TV into SL 2 (Z/7VZ). Modifying the entries of 7 modulo TV 
doesn't affect Q, so since SL 2 (Z) surjects to SL 2 (Z/7VZ) we may take 7 = 
[ a c b d ] £ SL 2 (Z). Let r = 7(1-') and let m = ct' + d. Then mr = ar' + b, so 

mA T = m(rZ 0 Z) = (ar' + b)Z ® (ct' + d)Z = r'Z ® Z = A T , 

(using Lemma 1.3.1 for the third equality), and 

( 1 ,\ cr' + d t 
m^-+A T )=^^+A T ,=Q. 

This shows that [E, Q] = [C/A T , 1/N + A T ] where r £ H. 

Suppose two points t,t' £tL satisfy F\(N)t = Ti(N)t' . Thus r = 7(r') 
where 7 = [" J] € Ti(N). Again let m — ct' + d. Then as just shown, 

mA T = A T ,, ml—+A T \= — +A r >. 

But since (c, d) = (0, 1) (mod N) the second equality is now m(l/N + A T ) = 
1/N + A T ,. Thus [C/A T ,1/N + A T ] = [C/A T ,, 1/N + A T ,\. 
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Conversely, suppose [C/A T , 1/N+A T ] = [C/A T ,, 1/N+A T >] with r, r' e H. 
Then for some m G C, mA T = A T i (by Corollary 1.3.3) and m(l/N + A T ) = 
l/N + A T i. By Lemma 1.3.1 the first of these conditions means that 

h d eSL 2 (Z), (l.io) 

so in particular m = or' + d. Now the second condition becomes 

cr' + d A 1 

— + At ' = n + a ^ 

showing that (c,d) = (0,1) (mod N) and 7 e A(-W). Since r = 7(1-') 
by (1.10), it follows that A(iV)r = A(A^)t'. □ 

Specializing to TV = 1, the theorem shows that the space of isomor- 
phism classes of complex elliptic curves parameterizes the modular curve 
Y 0 (l) = Yi(l) = y(l) = SL 2 (Z)\"H as mentioned in Section 1.3 and at 
the beginning of this section. This lets us associate a complex number to 
each isomorphism class. Recall the modular invariant j from Section 1.1, an 
SL 2 (Z)-invariant function on %. Each isomorphism class of complex elliptic 
curves has an associated orbit SL 2 (Z)r e SL 2 (Z)\"H and thus has a well 
defined invariant j(SL 2 (Z)t). This value is also associated to any complex 
elliptic curve E in the isomorphism class and correspondingly denoted j(E). 
The Modularity Theorem states that the elliptic curves with rational j-values 
arise from modular forms, as discussed back in the preface. 

It follows from Theorem 1.5.1 that maps of the modular curves Y 0 (N), 
Yi(N), and Y(N) give rise to maps of moduli spaces. For example, the natural 
map from Y\(N) to Yy(N) taking orbits r\(N)r to Io(7V)t becomes the map 
from Si (TV) to So(N) taking equivalence classes [E,Q] to [E, (Q)}, forgetting 
the generator but keeping the group it generates. For another example, since 
Ei(N) is a normal subgroup of ro(N) the quotient group acts on Y±(N) and 
therefore on S\(N). The action works out to 

A (iV) 7 :[E,Q]^[E, dQ] where 7 = [ « J}] (mod N) . 

This self-map will recur in Chapter 5 and thereafter as a Hecke operator of 
the sort mentioned in the preface. For details of these examples and others 
see Exercises 1.5.2 through 1.5.6. The hybrid maps from Exercise 1.5.6, 

[E,C,Q]^[E,Q], [E,C,Q}^[E/C 7 Q + C], (1.11) 

where C is a cyclic subgroup of prime order p and Q is a point of order TV 
and C n (Q) = {0e}, combine to describe the other Hecke operator to be 
introduced in Chapter 5, 

[E,Q]^^[E/C,Q + C\. 

c 



TUT 




t' 


m 


= 7 


1 



for some 7 = 
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The bijections between moduli spaces and modular curves give more ex- 
amples of modular forms. The idea is that a class of functions of enhanced 
elliptic curves corresponds to the weight- k invariant functions on the upper 
half plane. Let k be an integer and let r be one of r 0 (N), r 1 (N), or r(N). 
A complex-valued function F of the enhanced elliptic curves for r is degree-k 
homogeneous with respect to r if for every nonzero complex number m, 



F(C/mA,mC) ) (m- k F(C/A,C) if r = r 0 (N), 

F(C/mA, mQ) > = < m~ k F(C/A,Q) if r = A(iV), (1.12) 

F(C/mA,(mP,mQ)) J [ m - k F(C/A,(P,Q)) if r = r(N). 

Given such a function F, define the corresponding dehomogenized function 
f:U — > C by the rule 



Then / is weight-A: invariant with respect to _T. To see this, let 7 = [jj] ef 
and for any r <G H let m — (ct + (i) -1 . Then, e.g., for _T = i~i(iV), using the 
condition (c, e?) = (0, 1) (mod N) at the third step, 



/( 7 (r)) = F(C/i 7(T) , 1/JV + /1 7(T) ) - F(C/mA T , m(cr + d)/N + mA T ) 
= m- k F(C/A T , 1/N + A T ) = (ct + d) k f(r). 



For instance, the lattice Eisenstein series from Section 1.4 are degree-fc homo- 
geneous with respect to SL 2 (Z), dehomogenizing to Eisenstein series on the 
upper half plane. 

Conversely, let / be wcight-fc invariant with respect to r where r is one of 
r 0 (N), A(iV), or r(N). Then formula (1.13) turns around to define a func- 
tion F on enhanced elliptic curves of the special type (C/A T , (torsion data)) 
in terms of / (Exercise 1.5.7). If two such enhanced elliptic curves are equiv- 
alent, e.g., (C/A T >, l/N + A T t) = (C/mA r ,m/N + mA T ), then r = j(t') and 
m = ct' + d for some 7 = [" € r as in the proof of Theorem 1.5.1. Thus 
f(r) = m k /(r') and so F obeys formula (1.12) for those two points, 



F(C/A T , , 1/N + A T ,) = /(r') = m- k f(r) = m - k F(C/A T , 1/N + A T ). 



Since every enhanced elliptic curve is equivalent to an enhanced elliptic curve 
of the special type, formula (1.12) in its full generality extends F to a degree-fc 
homogeneous function of enhanced elliptic curves for r. 

As an example of the correspondence, let N > 1 and let v = (c v ,d v ) e Z 2 
be a vector whose reduction v modulo N is nonzero. Define a function of 
enhanced elliptic curves 



'f(c/a t ,(i/n + a t }) ifr 

f{r)= (F{C/A T ,1/N + A T ) ifr 

f{c/a t ,{t/n + a t ,i/n + a t )) ar 



r 0 (N), 

A (AO, 

r(N). 



(1.13) 



F$(C/A, (P,Q)) = 



1 

iV 2 " 



p A (c v P + d v Q). 
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(The superscript v is just a label, not an exponent.) Then F$ is degree-2 
homogeneous with respect to r(N) (Exercise 1.5.8(a)). Chapter 4 will use the 
corresponding function 



C V T ■ 



d v 



N 



to construct Eisenstein series of weight 2. Similarly, recall the lattice constants 
92(A) = 60G 4 (yl) and g 3 (A) = 140G 6 (/1), the integer multiples of lattice 
Eisenstein series from Section 1.4, and define functions of enhanced elliptic 
curves for r(N), A (TV), and r 0 (N), 



F£(C/A,(P,Q)) 
F$(C/A,Q) 

F 0 (C/A,C) 



92(A) 



p A (c v P + d v Q), 

p A (dQ), 



93(A) 
92(A) 

F^(C/A, (P,Q)) 
92(A) 
9s(A) 

N-l 



QeC-{o} 

Y,Fi(C/A,dQ) 
d=i 



deZ,d^0 (mod N) 
for suitable P, 

for any generator Q of C. 



Each of these functions is degree-0 homogeneous with respect to its group 
(Exercise 1.5.8(b)). The corresponding weight-0 invariant functions are 



fS(r) 



92(t) 
9s(r) 



C V T ■ 



d v 



f d M 92(r) 



N 



JV-l 



93 (i~) jri 



N 



N-l 



Chapter 7 will use these functions to show how polynomial equations describe 
modular curves. We could have defined the functions with g2(r) g 3 (r) / A(t) 
where we have 32 (t) /<73(t), to avoid poles in H, but the meromorphic functions 
here are normalized more suitably for our later purposes. 



Exercises 



1.5.1. Prove the other two parts of Theorem 1.5.1. (A hint for this exercise 
is at the end of the book.) 
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1.5.2. The containments r(N) C Ji(A') C Jo (AT) give rise to subjections 
Y(N) — > Y^N) and Y^N) — > Y 0 (N) given by r(N)r ^ A(7V)r and 
A(A r )r i ^ r n (N)r. Describe the corresponding maps between moduli spaces. 
(A hint for this exercise is at the end of the book.) 

1.5.3. Since the group containments mentioned in the previous exercises are 
normal, it follows that the quotient r 1 (N)/r(N) ~ Z/NZ (cf. Section 1.2) 
acts on the modular curve Y(N) by multiplication from the left, and similarly 
for r 0 (N) I T 1 (N) = (Z/NZ)* and Yi(N). Describe the corresponding action 
of Z/NZ on the moduli space S(AT), and similarly for (Z/NZ)* and Si (AT). 
(A hint for this exercise is at the end of the book.) 

1.5.4. Let wn = [jvo] GL^Q). Show that wjy normalizes the group 
Jo (AT) and so gives an automorphism r 0 (N)r i->- r n (N)wN(r) of the mod- 
ular curve Y 0 (N). Show that this automorphism is an involution (meaning 
it has order 2) and describe the corresponding automorphism of the moduli 
space So(N). (A hint for this exercise is at the end of the book.) 

1.5.5. Let A^ be a positive integer and let p be prime. Define maps 7Ti,7T2 : 
Y 0 {Np) — ► Y 0 (N) to be 7r 1 (r 0 (A^p)T) = r Q (N)r and n 2 (r 0 (Np)T) = 
Jo(A r )(pr). How do the corresponding maps ni,n 2 : S 0 (Np) — > S 0 (N) act 
on equivalence classes \E : C]1 Same question, but for J\(A?p) and A(A^) and 
Si(Np) and Si(N), and then again for r(Np), etc. 

1.5.6. Let Af be a positive integer and let p be prime. Define congruence 
subgroups of SL 2 (Z), 



J» = 



a b 
c d 



€ SL 2 (Z) : 



a b 

c d 



* 0 



(mod p) 



and 

r°(N, P ) = r 1 (N)nr°(p), 

and define a corresponding modular curve, 

Y 1 °(N,p) = Y(r?(N,p)). 

An enhanced elliptic curve for J^A^p) is an ordered triple (E,C,Q) where 
J? is a complex elliptic curve, C is a cyclic subgroup of E of order p, and Q is 
a point of E of order A^ such that CC\(Q) = {Oe}- Two such triples (E, C, Q) 
and (E',C',Q') are equivalent if some isomorphism E E' takes C to C 
and Q to Q' . The moduli space for ri(N,p) is the set of equivalence classes, 

S°(N,p) = {enhanced elliptic curves for r°(N, p)}/ ~ . 

An clement of S?(AT,p) is denoted [E, C, Q]. 

(a) By the proof of Theorem 1.5.1(b), every element of Sl(N,p) takes 
the form [C/A T >,C,l/N + A T >]. Show that C has a generator of the form 
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( T ' +j)/p+ At' or 1/-P+ A r >, this last possibility arising only if p\ N. (This 
idea will be elaborated in Section 5.2.) Show that therefore C and Q can be 
assumed to take the form 

/ar' + b A \ ^ cr' + d 

C=^— r + A T ,y Q=^^ + A Tl , 

Let 7 = [ a c b d ] and let r = 7 (r'). Show that [C/A T >, C, 1/N + A T >] = 
[C/A T , (r/p + At), 1/N + A T \. (A hint for this exercise is at the end of the 
book.) 

(b) Part (a) shows that the moduli space for r®(N,p) is 

S?(JV,p) - {[C/A T , (t/p + At), 1/N + A T ] : r G H}. 

Show that two points of Sj (N, p) corresponding to r, t' £ W are equal if and 
only if r®(N,p)r = r®(N,p)T' , and thus there is a bijection 

V? : S°(iV,p) — ► If (JV,p), [C/i r , (r/p + A T ), 1/N + A T ] ^ r?(N,p)r. 

(c) Show that the maps r?(N,p)r i— > A(iV)T and r?(N,p)r *—> r^N^/p 
from Y®(N,p) to Yi(iV) are well defined. Show that the corresponding maps 
from Si(N,p) to Si (AT) are as described in (1.11). 

1.5.7. Show that if two enhanced elliptic curves for r of the special type 
(C/A T , (torsion data)) are equal then r' = 7(7-) for some 7 = [J J] e T. Show 
that consequently if f :% — 5- C is wcight-fc invariant with respect to r then 
formula (1.13) gives a well defined function F on enhanced elliptic curves of 
the special type. 

1.5.8. (a) Show that the function F% from the section is degree-2 homogeneous 
with respect to r(N). 

(b) Similarly, show that the functions Fq , Fq, and Fo are degree-0 homo- 
geneous with respect to r(N), A (TV), and r 0 (N) respectively. 



a b 
c d 



G SL 2 (Z). 
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Modular Curves as Riemann Surfaces 



For any congruence subgroup r of SL 2 (Z) the corresponding modular curve 
has been defined as the quotient space r\H, the set of orbits 

Y(r) = {/v ■. t e H}. 

This chapter shows that Y(r) can be made into a Riemann surface that can 
be compactified. The resulting compact Riemann surface is denoted X(r). 

Related reading: [FK80] is one of many good books about Riemann sur- 
faces. 



2.1 Topology 

The upper half plane % inherits the Euclidean topology as a subspace of R 2 . 
The natural surjection 

it : H — > Y(r), tt(t) = JV 

gives Y(r) the quotient topology, meaning a subset of Y(r) is open if its 
inverse image under tt in % is open. This makes 7r an open mapping (Exer- 
cise 2.1.1). And the following equivalence holds (Exercise 2.1.2): 

7r(f/i) n tt(U 2 ) = 0 in Y(r) r(Ui) n U 2 = 0 in H. (2.1) 

Since % is connected and 7r is continuous, the quotient Y(r) is also connected. 

This section will show that Y(r) is Hausdorff, meaning distinct points 
have disjoint neighborhoods. The key to this, and to putting coordinate charts 
on Y(r) in the next section, is the idea that any two points in H have neigh- 
borhoods small enough that every SL 2 (Z) transformation taking one point 
away from the other also takes its neighborhood away from the other's. In 
the parlance of topology, the action of SL 2 (Z) on H is properly discontinuous. 
The precise statement is 
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Proposition 2.1.1. Let t\,t 2 G H be given. Then there exist neighborhoods 
Ui of n and U 2 of t 2 in % with the property 

for all 7 e SL 2 (Z), if ^{Ux) C\U 2 ^% then 7 (n) = t 2 . 

Note that n and r 2 in the proposition can be equal. 

With the transformations SL 2 (Z) viewed as congruent motions, the upper 
half plane H has a geometry concentrated down near the real axis, essentially 
the same as how the hyperbolic geometry of the disk (perhaps familiar to the 
reader from M. C. Escher's artwork) is concentrated out near the rim. This 
idea, which will become clearer in Section 2.3, motivates condition (2.2) in 
the following proof. 

Proof. Let U[ be any neighborhood of n with compact closure in H and 
similarly for U 2 . Consider the intersection 7(f/() fl U 2 for 7 = [" J] G SL 2 (Z). 
For all but finitely many integer pairs (c, d) with gcd(c, rf) = 1 the condition 

sup{Im(7(r)) : 7 G SL 2 (Z) has bottom row (c, d), r <G 

< inf{Im(r) :t£^} ( ' ^ 

holds (Exercise 2.1.3(a)), making j(U[) D £7 2 empty. Also, for every integer 
pair (c, d) with gcd(c, d) = 1, the matrices 7 € SL 2 (Z) with bottom row (c, d) 
are 

""1 fc" 



0 1 



a 6 
c d 



fc e Z 



where (a, 6) is any particular pair such that ad — be = 1 (Exercise 2.1.3(b)). 
Thus j(U[) n U' 2 = {[ a c b d ] (U[) + k) n U' 2 is empty for all but finitely many 7 
with bottom row (c, d). Combining these two results shows that j(U[) inter- 
sects U 2 for only finitely many 7 G SL 2 (Z). (See Exercise 2.1.3(c) for a less 
elementary way to prove this. The exercise establishes some facts that will 
make Proposition 2.2.2, to follow, clearer as well.) 

Let F = {7 G SL 2 (Z) : j(U[) n t/ 2 ^ 0, 7(7-1) ^ r 2 }, a finite set. For each 
7 G F there exist disjoint neighborhoods U\ n of 7(n) and [/ 2j7 of t 2 in H. 
Define 



Ui = U[C\ fl 7 1 (?7i, 7 ) , a neighborhood of n in "H, 




^2 = t/ 2 n f) c/ 2 . 7 I j a neighborhood of r 2 in W. 

\yeF J 

Take any 7 G SL 2 (Z) such that y(Ui) n U 2 ^ 0. To show 7(7-1) = t 2 it 
suffices to show 7 ^ F. But if 7 G F, then 7~ 1 ([/i !7 ) D U\ and ?7 2 , 7 D f7 2 , so 
Ui l7 n f7 2 , 7 3 7(^1) n C/ 2 7^ 0, a contradiction since f/i i7 and C/ 2j7 are disjoint. 
This completes the proof. □ 
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Corollary 2.1.2. For any congruence subgroup r of SL 2 (Z), the modular 
curve Y(r) is Hausdorff. 

Proof. Let 7t(ti) and 7r(r 2 ) be distinct points in Y(r). Take neighborhoods 
U\ of ri and £/ 2 of t 2 as in Proposition 2.1.1. Since j(ti) ^ r 2 for all 7 e T, 
the proposition says that r(U\) n ?7 2 = 0 in H, and so equivalence (2.1) shows 
that n(Ui) and 7r(/7 2 ) are disjoint supersets of 7t(ti) and 7t(t2) in Y(r). They 
are neighborhoods since tt is an open mapping. □ 

It needs noting that Y(r) is small enough to be a Ricmann surface. A 
Ricmann surface is a 1-dimensional connected complex manifold, and by def- 
inition the set underlying a manifold must have a second countable topology, 
meaning the topology has a countable basis. This is certainly true for Eu- 
clidean space and therefore for the quotient Y(r). For more on this terminol- 
ogy, see any point set topology text, e.g., [MunOO]. 

Exercises 

2.1.1. Show that giving Y(r) the quotient topology makes the natural sur- 
jection n an open mapping. 

2.1.2. Establish equivalence (2.1). 

2.1.3. (a) Establish inequality (2.2). (A hint for this exercise is at the end of 
the book.) 

(b) Show that the set of matrices in SL 2 (Z) with bottom row (c, d) is 
{ [ 0 1 ] [ " d ] : ^GZ} where (a, b) is any particular pair such that ad—bc= 1 . 

(c) The first part of the proof of Proposition 2.1.1 can also be argued by 
studying the particular point i. Show that in the group SL 2 (R), the subgroup 
of elements fixing i is the special orthogonal group S0 2 (R). Note that the 
group SL 2 (R) takes i to any point t = x + iy e H since the function s : 
U — > SL 2 (R) given by 

s(r) = 4= 

satisfies s(r)i = r. (So now the upper half plane has a purely group the- 
oretic description, H = SL 2 (R)/S0 2 (R)-) Use these facts to show that 
for any 7 G SL 2 (R) and ei,e 2 G H, the conditions 7(ei) = e 2 and 
7 € s ( e 2)S0 2 (R)s(e!) -1 are equivalent. Let e\ range over the compact closure 
of the set U[ in the proof, and similarly for e 2 , to show that i(U[) intersects 
U' 2 only for 7 in a certain compact group. Since 7 must also lie in the discrete 
group SL 2 (Z), conclude that only finitely many such 7 exist. 



0 1 
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2.2 Charts 

The next task is to put local coordinates on the modular curve Y(r). This 
means finding for each point 7t(t) e Y(r) a neighborhood U (the symbol U 
is reserved for neighborhoods in %) and a homcomorphism tp : U — > V C 
C such that the transition maps between the local coordinate systems are 
holomorphic. 

At a point 7t(t) where t £ H is fixed only by the identity transformation 
in r, i.e., only by the matrices F n {±/}, this is easy: a small enough neigh- 
borhood U of t in H is homeomorphic under n to its image ir(U) in Y(r), as 
Proposition 2.1.1 guarantees such a neighborhood with no F-equivalent points 
(Exercise 2.2.1). So a local inverse : n(U) — > U could serve as the local 
coordinate map. 

But a point 7t(t) where r has a nontrivial group of fixing transformations 
in r (its isotropy subgroup) poses more of a problem. For example, let F = 
SL 2 (Z) and let r = i, a fixed point under 7 = [° "J] . In the small 7 acts as 
180-degree rotation about i (Exercise 2.2.2), so any neighborhood U of i in H 
contains pairs of 7-equivalent points (see Figure 2.1) and thus can't biject to a 
neighborhood of n(i) in SL2(Z)\"H — projecting to the modular curve identifies 
the two halves of the neighborhood. This illustrates that the points to worry 
about satisfy 

Definition 2.2.1. Let r be a congruence subgroup o/SL 2 (Z). For each point 
t G H let F r denote the isotropy subgroup of t, i.e., the r-jlxing subgroup 

ofr, 

r T = { 1 er-. 7 (r) - T }. 

A point t € H is an elliptic point for r (or of T) if T T is nontrivial as 
a group of transformations, that is, if the containment {±I}T T D {±7} of 
matrix groups is proper. The corresponding point 7t(t) e Y(r) is also called 
elliptic. 




Figure 2.1. Pairwise equivalence about i 
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Note that F T also fixes the point r G — "H since -T C SL 2 (R). ("Elliptic 
point" is unrelated to "elliptic curve" from Chapter 1 other than each having 
a distant connection to actual ellipses.) 

The next result will be established in the following section. 

Proposition 2.2.2. Let F be a congruence subgroup o/SL2(Z). For each el- 
liptic point t of r the isotropy subgroup T T is finite cyclic. 

Thus each point r € H has an associated positive integer, 



This h T is called the period of t, with h T > 1 only for the elliptic points. If 
t e H and 7 e SL 2 (Z) then the period of j(t) under -fTj^ 1 is the same as the 
period of r under J 1 (Exercise 2.2.3). In particular, /i T depends only on IV, 
making the period well defined on Y(r), and if r is normal in SL 2 (Z) then 
all points of Y(r) over a point of F(SL 2 (Z)) have the same period. The space 
Y(P) depends on f as a group of transformations acting on T-L, and —I acts 
trivially, so defining the period as we did rather than simply taking h T = \r r \ 
is natural. The definition correctly counts the r-fixing transformations. 

To put coordinates on Y(r) about a point 7t(t), first use the map 
5 T = [ 1 Z^] <= GL 2 (C) (the group of invertible 2-by-2 matrices with com- 
plex entries) to take t to 0 and r to 00. The isotropy subgroup of 0 in the 
conjugated transformation group, (<5 T {±/}P5~ 1 )o/{±/} ) is the conjugate of 
the isotropy subgroup of r, S T ({±I}r T /{±I})6~ 1 , and therefore is cyclic of 
order h T as a group of transformations by Proposition 2.2.2. Since this group 
of fractional linear transformations fixes 0 and 00, it consists of maps of the 
form z i->- az, and since the group is finite cyclic these must be the rotations 
through angular multiples of 2ir/h T about the origin. Thus S T is "straighten- 
ing" neighborhoods of t to neighborhoods of the origin in the sense that after 
the map, equivalent points are spaced apart by fixed angles. This suggests 
(as Figure 2.1 has illustrated with h T — 2) that a coordinate neighborhood 
of 7t(t) in Y(r) should be, roughly, the 7r-image of a circular sector through 
angle 2ir/ 'h T about r in H, and that the identifying action of ir is essentially 
the wrapping action of the h T th power map, taking the sector to a disk. 

To write this down precisely and check that it works, start with another 
consequence of Proposition 2.1.1 (Exercise 2.2.4), 

Corollary 2.2.3. Let r be a congruence subgroup o/SL 2 (Z). Each point r G 
H has a neighborhood U in % such that 



K = |{±/}JV/{±/}| 



( 



\r T \/2 if-/er T) 
\r T \ tf-Jgr T . 



for all 7 e r, if-){U) n U ^ 0 then 7 e i~V • 



Such a neighborhood has no elliptic points except possibly r. 
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Figure 2.2. Local coordinates at an elliptic point 



Now given any point tt(t) G Y(r), take a neighborhood U as in the corol- 
lary. Define ip : U — > C to be ip — p o 5 where S — 5 T and p is the power 
function p(z) = z h with h = h T as above. Thus ip(r') = (S(T')) h acts as the 
straightening map 5 followed by the /i-fold wrapping p. (See Figure 2.2.) Let 
V = ip(U), an open subset of C by the Open Mapping Theorem from complex 
analysis. Since the projection ir and the wrapping ip identify the same points 
of U, there should exist an equivalence between the images of U under the 
two mappings. To confirm this, consider the situation 

U ir(U) C Y(r), U A V C C 

and note that for any points t\,T2 G U, 

7t(ti) = 7t(t 2 ) ri G Ft2 T\ G _T t t 2 by Corollary 2.2.3 

<=► <5(n) G (W" 1 )^)) - 4(^(r 2 )) for some d, 

where pt = e 27Tl / h , since STrS^ 1 is a cyclic transformation group of h rota- 
tions. So 

Tr(ri) = tt(t 2 ) <=► (<5(r 1 ))' 1 = (S(r 2 )) h <=► V(n) = V(t 2 ), 

as desired. Thus there exists an injection 95 : 7r({7) — ► 1/ such that the 
diagram 




tt(U) > V 

commutes. Also, <p surjects because ip surjects by definition of V. The map 
ip is the local coordinate. The coordinate neighborhood about 7t(t) in Y(r) 
is tt(U) and the map ip : n(U) — > V is a homeomorphism (Exercise 2.2.5). 
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In sum, U is a neighborhood with no elliptic point except possibly r; 
ip : U — > V is the composition ip = p o 8 that matches the identifying action 
of 7T by wrapping U onto itself as prescribed by the period of r; and the local 
coordinate ip : n(U) — > V, defined by the condition ip o n = ip, transfers the 
identified image of U in the modular curve back to the wrapped image of U 
in the complex plane. 

We need to check that the transition maps between coordinate charts are 
holomorphic. That is, given overlapping ir(Ui) and 7r(£/ 2 ), we need to check 
(fi2A, the restriction of tp 2 ° (fii 1 to tpi{Tr{U\) n 7r([/ 2 )). Let = (fii(n(Ui) n 
n(U2)) and T/2,1 = ^2(^(^1) H 7r(/7 2 )) and write the commutative diagram 



7r(J7i)n7r(CA 2 ) 




For each x G 7r(£/i) n 7r([/ 2 ) it suffices to check holomorphy in some neigh- 
borhood of tfi(x) in Vi.2- Write x = 7t(ti) = 7r(r 2 ) with ri G f/i, t 2 G C/ 2 , 
and r 2 = 7(ri) for some 7 e f. Let f/i^ = U\ fl 7~ 1 ([/ 2 ). Then the projec- 
tion 7r([/i, 2 ) is a neighborhood of x in 7r(f/i) n 7r(J7 2 ) and so <pi(7r(t r i j2 )) is a 
corresponding neighborhood of <pi(x) in Vi. 2 . 

Assume first that tpi(x) — 0, i.e., the first straightening map is Si = S Tl . 
Then an input point q = fi(x') to Lp 2 ,i in this neighborhood takes the form 

q = <Mtt(t')) = ^i(r') = (^i(t'))' 11 for some r' G L7i, 2 

where /ii is the period of n. Letting f 2 € i7 2 be the point such that ^2(^2) = 0 
and letting /i 2 be its period, the corresponding output is 

¥?2(z') = <p 2 (7r(7(V))) = '0 2 (7(r')) which is defined since j(t') G U2 

= (S2(j(T>))) h > = ((S 2l S^)(Si(T>))) h z 

= ((<527<yr 1 )(« 1/fci )) fca - 

This calculation shows that the only case where the transition map might 
not be holomorphic is when hi > 1, meaning n is elliptic and hence so 
is t 2 = 7(71) with the same period. Recall that U2 contains at most one 
elliptic point by construction and then the local coordinate takes it to 0. So 
when hi > 1 the point r 2 is the point f 2 G U2 mentioned above, the second 
straightening map is 82 = S T2 , and /i 2 = hi . Thus 

„ S l 1 7 <?2 _ 7 _ <5 2 

Ol >Tl I >T 2 I >0, OO I >Tl\ >T 2 I >OQ, 

showing that 827 8 ^ 1 = [0/3] f° r some nonzero a, (3 G C. The formula for 
</?2,i becomes 



52 



2 Modular Curves as Riemann Surfaces 



q^([% O 0](q 1/h )) h = (<*/0) h q 

and the map is clearly holomorphic. 

So far the argument assumes that <pi(x) = 0. But it also covers the case 
f2(x) = 0 since the inverse of a holomorphic bijection is again holomorphic. 
And in general, </?2,i is a composite <P2,3 0< P3,i where ips '■ ""(^3) — > V3 takes x 
to 0, so in fact the argument suffices for all cases. 

Exercises 

2.2.1. Let t e H be fixed only by the identity transformation in r. Use 
Proposition 2.1.1 to show that some neighborhood U of r is homeomorphic 
to its image in Y(r). 

2.2.2. Compute [ \ [ \ ~q] [ \ ~-] 1 and explain how this shows that the 
map 7(7-) = — 1/t acts in the small as 180-degree rotation about i. 

2.2.3. Show that if r e "H and 7 g SL 2 (Z) then the period of 7(r) un- 
der 7-T7 -1 is the same as the period of r under J\ 

2.2.4. Prove Corollary 2.2.3. 

2.2.5. Explain why ip : ir(U) — > V as defined in the section is a homeomor- 
phism. (A hint for this exercise is at the end of the book.) 

2.3 Elliptic points 

It remains to prove Proposition 2.2.2, that for any congruence subgroup r 
of SL 2 (Z) each elliptic point r of r has finite cyclic isotropy subgroup F T . 
The process of doing so will develop a much clearer picture of Y(r) and show 
that it has only finitely many elliptic points. 

The simplest case is Y(l) = SL 2 (Z)\%. The next two lemmas will show 
that Y(l) can essentially be identified with the set (Figure 2.3) 

V= {t g H : |Rc(r)| < 1/2, |r| > 1}. 

By Section 1.5, this set V also essentially represents the equivalence classes of 
complex elliptic curves under isomorphism, the point r S V representing the 
class of C/A T where A T = tZ © Z. 

Lemma 2.3.1. The map n : V — ► Y(l) surjects, where tt is the natural 
projection 7r(r) = SL 2 (Z)t. 
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Figure 2.3. The fundamental domain for SL2(Z) 



Proof. Given r € % it suffices to show that r is SL 2 (Z)-equivalent to some 
point in T>. Repeatedly apply one of [ o ± i ] : t l_ t ± 1 to translate r into the 
vertical strip {|Re(r)| < 1/2}, and replace r by this transform. Now if t ^ X> 
then \t\ < 1 and so Im(— 1/r) = Im(— f/\r\ 2 ) = Im(r/|r| 2 ) > Im(r); replace 
t by [? ~o] ( r ) = — l/ r an d repeat the process. Since there are only finitely 
many integer pairs (c, d) such that |cr + d| < 1 (because there are only finitely 
many lattice points inside a disk), the formula 



Im(7r) 



Im(r) 

\cT + d\ 2 



for 7 : 



a b 

c d 



G SL 2 (Z) 



shows that only finitely many transforms of r have larger imaginary part. 
Therefore the algorithm terminates with some t £ V. □ 

The surjection it : T> — > ^(1) isn't injective. The translation [J \] : 
t i \ t ~\~ \ identifies the two boundary half-lines, and the inversion [ \ ~ \ ] : 
t i \ —1/t identifies the two halves of the boundary arc. But these boundary 
identifications are the only ones: 

Lemma 2.3.2. Suppose t\ and r 2 are distinct points in V and that r 2 = 7T1 
for some 7 e SL 2 (Z). Then either 

(1) Re(ri) = ±1/2 and t 2 = n =F 1, or 

(2) |n| = 1 andr 2 = -1/n. 

Proof. (Sketch — Exercise 2.3.1 asks for details.) Assume Im(r 2 ) > Im(n) by 
symmetry. Let 7 = [ a c h d \ - Then |cri + d| 2 < 1 since Im(r 2 ) > Im(n), and 
Im(n) > V3/2 since ti eP. So 

|c|v / 3/2 < |c|Im(n) = |Im(cTi + d)\ < |cn + d| < 1, 



and since c e Z this shows that |c| € {0, 1}. 

If c = 0 then 7 = ± [J \ \ and Re(r 2 ) = Re(ri) + b, forcing 
holds. 



1 and (1) 
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If |c| = 1 then the condition \ct\ + d\ 2 < 1 becomes |n ± <i| 2 < 1, or 
(Re(n) ± df + (Mtx)) 2 < 1, implying (Re(n) ± df < 1 - (Imfr!)) 2 < 
1 - 3/4 = 1/4, so |Re(n) ± d| < 1/2, forcing |d| < 1. 

If |c| = 1 and |d| = 1 then in the preceding calculation all inequalities 
must be equalities. It follows that Im(ri) = \/3/2 and |Re(ri) ± 1| = 1/2, so 
Re(n) = ±1/2 and both (1) and (2) hold. 

If |c| = 1 and d = 0 then the condition \ct\ + d\ < 1 becomes |n| < 1, so in 
fact |ti| — 1 (since t\ € 2?) and Im(r 2 ) = Im(ri). So also |t"2 | = 1 by symmetry 
since now n and r 2 have the same conditions on their imaginary parts and on 
the c-entries of the matrices transforming each to the other. Thus t\ and t 2 
have the same absolute value and the same imaginary part but are distinct, 
forcing their real parts to be opposites and (2) holds. □ 

So with suitable boundary identification the set I? is a model for Y(l) = 
SL2(Z)\"H, also called a fundamental domain for SL 2 (Z). Topologically, T> 
modulo the identification is a plane. Figure 2.4 shows some SL 2 (Z)-translates 
of T>. The whole configuration repeats horizontally with period 1 under iter- 
ations of [q \~\ :th>t±1. The figure shows how the SL 2 (Z)-translates of 
a point r e W can cluster only down toward the real axis, perhaps giving a 
more intuitive understanding of Proposition 2.1.1. 




Figure 2.4. Some SL2(Z)-translates of V 
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Returning to elliptic points, suppose r e 77 is fixed by a nontrivial trans- 
formation € SL 2 (Z). Then ar + 6 = cr + rfr; solving for r with the 
quadratic equation (c = 0 is impossible since r ^ Q) and remembering that 
r£?i shows that |a + d| < 2 (Exercise 2.3.2). Thus the characteristic polyno- 
mial of 7 is x 2 + 1 or i 2 ±x+l. Since 7 satisfies its characteristic polynomial, 
one of 7 4 = 7, 7 s = J, 7 s = 7 holds, and 7 has order 1, 2, 3, 4, or 6 as a 
matrix. Orders 1 and 2 give the identity transformation (Exercise 2.3.3). So 
the following proposition describes all nontrivial fixing transformations. 

Proposition 2.3.3. Let 7 e SL 2 (Z). 

(a) 1/7 has order 3 then 7 is conjugate to _ 1 ] ±1 in SL 2 (Z). 

(b) 1/7 /ias order ^ i/ien 7 is conjugate to [1 ^o]* 1 * n SL 2 (Z). 

(c) 7/7 has order 6 then 7 is conjugate to [" _ 1 ] ±1 in SL 2 (Z). 

Proof, (c) Since 7 s — 7 the lattice 7 = Z 2 of integral column vectors is a 
module over the ring Z[/i 6 ] where /it 6 = e 27 ™/ 6 , defining the scalar-by- vector 
product (a + bfi 6 ) ■ v for a, b e Z and w e 7 as the matrix-by-vector product 
(al + bj)v. 

The ring Z[^ 6 ] is known to be a principal ideal domain and 7 is finitely 
generated over it. The structure theorem for modules over a principal ideal 
domain therefore says that 7 is Z [/^-isomorphic to a sum Z[fio\/Ik where 
the Ik are ideals. As an Abclian group, 7 is free of rank 2. Every nonzero 7^ 
of Z[fis] has rank 2 as an Abelian group, making the quotient Z[/t 6 ]/7fe a 
torsion group, so no such terms appear in the sum. Only one free summand 
appears, for otherwise the sum would be too big as an Abelian group. Thus 
there is a Z[/t 6 ]-module isomorphism 0 7 : Z[/i 6 ] — > 7. 

Let u = </> 7 (l) and v — </> 7 (/i@) and let [u v] denote the matrix with columns 
u and v. Then 7 = Zu + Zv so det[n v] € {±1} (Exercise 2.3.4(a)). Compute 
that ju = hq ■ — (f) 7 (^e) — v, and similarly — fiQ ■ <p^(no) = 0 7 (/i|) = 

<f>f(— 1 + Ho) = — u + v. Thus 

-f[u v] = [v -u + v] — [u v] [" ~{] , so 7 = [u v] [° ~{] [u v}' 1 , 

and 

-y[v u] = [-u + v v] = [v u] [ _H] , so 7 = [v u] [° ~{] 1 [v u}^ 1 . 

One of [u v], [v u] is in SL 2 (Z), proving (1). 

Parts (a) and (b) are Exercise 2.3.4(b). □ 

Now we can understand the elliptic points and their isotropy subgroups. 

Corollary 2.3.4. The elliptic points for SL 2 (Z) are SL 2 (Z)i and SL 2 (Z),u 3 
where ^3 = e 27 ™' 3 . The modular curve Y{1) = SL 2 (Z)\77 has two elliptic 
points. The isotropy subgroups of i and ^3 are 
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SL 2 (Z) < = <[?-J]) and SL 2 (Z) 



f*3 — 



<[?"!]>• 



For each elliptic point t of SL 2 (Z) the isotropy subgroup SL 2 (Z) r is finite 



Proof. The fixed points in H of the matrices in Proposition 2.3.3 are i and ^3. 
The first statement follows (Exercise 2.3.5(a)). The second statement follows 
since i and 113 are not equivalent under SL 2 (Z). The third statement can 
be verified directly (Exercise 2.3.5(a) again), and the fourth statement fol- 
lows since all other isotropy subgroups of order greater than 2 are conjugates 
of SL 2 (Z)i and SL 2 (Z)^ 3 . See Exercise 2.3.5(b) for a more conceptual proof 
of the fourth statement. □ 

Proposition 2.2.2 and a bit more now follow. 

Corollary 2.3.5. Let r be a congruence subgroup o/SL 2 (Z). The modular 
curve Y(r) has finitely many elliptic points. For each elliptic point t of T the 
isotropy subgroup F T is finite cyclic. 

Proof. If SL 2 (Z) = Uj=i Flj tncn tnc elliptic points of Y(r) are a subset of 
Ep = {rjj(i),r-fj(n 3 ) : 1 < j < d}. The second statement is clear since r T 
is a subgroup of SL 2 (Z) T for all r GH. □ 

For a congruence subgroup r, if SL 2 (Z) = (J •{±7}r , 7j then the set U • jjV 
surjects to Y(r) (Exercise 2.3.8), and with suitable boundary identification 
this is a bijection as before. The set need not be a fundamental domain, 
however, since a fundamental domain is also required to be connected. 

Exercises 

2.3.1. Fill in any details as necessary in the proof of Lemma 2.3.2. 

2.3.2. If [ a c b d ] e SL 2 (Z) fixes r e H, show that \a+ d\ < 2. 

2.3.3. Show that if 7 e SL 2 (Z) has order 2 then 7 = -I. 

2.3.4. (a) In the proof of Proposition 2.3.3, why does the condition L = 
Zu + Zv imply det[w v] = ±1? 

(b) Prove the other two parts of Proposition 2.3.3. (Hints for this exercise 
are at the end of the book.) 

2.3.5. (a) Complete the proof of Corollary 2.3.4. (A hint for this exercise is 
at the end of the book.) 

(b) Give a more conceptual proof of the third statement in Corollary 2.3.4 
as follows: The results from Exercise 2.1.3(c), that the isotropy subgroup of i 
in SL 2 (R) is S0 2 (R) and that H = SL 2 (R)/S0 2 (R), show that an element 
s(t) of SL 2 (R) moves r to % and the isotropy subgroup of r correspondingly 
conjugates to a discrete subgroup of S0 2 (R). But since S0 2 (R) is the rota- 
tions of the circle, any such subgroup is cyclic. 



cyclic. 
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2.3.6. In the proof of Corollary 2.3.5, need the 2d points in Y(r) listed in 
the description of E r be distinct? Need the points of E r all be elliptic points 



2.3.7. Prove that there are no elliptic points for the following groups: (a) 
r{N) for N > 1, (b) A (AT) for A^ > 3 (also, find the elliptic points for A (2) 
and A (3) given that each group has one), (c) A (AO for A" divisible by any 
prime p = — 1 (mod 12). (A hint for this exercise is at the end of the book.) 
In the next chapter we will extend the technique of proving Proposition 2.3.3 
to count the elliptic points of A (AO for all N. 

Also, show that if the congruence subgroup r does not contain the negative 
identity matrix —I then r has no elliptic points of period 2. 

2.3.8. Suppose SL 2 (Z) = \Jj{±I}r-fj where f is a congruence subgroup. 
Show that (J • -fjV surjects to Y(r). (A hint for this exercise is at the end of 
the book.) 

2.4 Cusps 

Transferring the fundamental domain V (as pictured in Figure 2.3) to the 
Ricmann sphere via stereographic projection gives a triangle with one vertex 
missing. (The reader should sketch this before continuing.) Clearly the triangle 
should be compactified by adjoining the point at infinity, to be thought of as 
infinitely far up the imaginary axis in the plane or as the north pole on the 
sphere. 

In general, let r be a congruence subgroup of SL 2 (Z). Chapter 1 defined 
the cusps of r as the /^-equivalence classes of Q U {oo}. This section will 
adjoin cusps with appropriate local coordinate charts to the modular curve 
Y(r), completing Y(r) to a compact Riemann surface denoted X(r). The 
coordinates to be used have already been built into the cusp condition in the 
definition of modular forms. The compactified X(r) is also called a modular 
curve. 

The group GL^ (Q) of 2-by-2 matrices with positive determinant and ra- 
tional entries acts on the set Q U {oo} by the rule 



where this means to take oo to a/c and —d/c to oo if c ^ 0 and to take oo 
to oo if c = 0. As the matrices are nonsingular no 0/0 expressions will arise. 
The subgroup SL 2 (Z) acts transitively since any rational number takes the 
form s — a/c with a, c € Z and gcd(a, c) — 1, meaning that ad — be — 1 for 
some b,d £ Z, and [" J] (oo) = s. (Note how this geometrically motivates 
part of Exercise 1.2.11(b), that every 7 £ GL^ (Q) factors as 7 = cry', with 
a £ SL 2 (Z) and 7' = [J t]- if the matrix 7 £ GL^Q) doesn't already have 



of n 
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lower left entry 0 then it takes oo to some rational number s. Left multiplying 
by a suitable a -1 € SL 2 (Z) takes s back to oo, meaning the product 7' = oT 1 ^ 
fixes 00 and therefore has lower left entry 0 as desired.) If a € SL 2 (Z) takes 00 
to a rational number s then a transforms the fundamental domain T> to a 
region that tapers to a cusp at s (cf. Figure 2.4), just as T> itself tapers to a 
cusp at 00 on the Riemann sphere. The isotropy subgroup of 00 in SL 2 (Z) is 
the translations, 



Let r be a congruence subgroup of SL 2 (Z). To compactify the modular 
curve Y(r) = r\H, define H* = "HUQU{oo} and take the extended quotient 



The points Ts in -T\(Q U {00}) are also called the cusps of X(T). For the 
congruence subgroups r 0 (N), i~i (N) , and T(N) we write X 0 (N), X^N), 
and X(N). 

Lemma 2.4.1. The modular curve X(l) = SL 2 (Z)\"H* has one cusp. For any 
congruence subgroup T o/SL 2 (Z) the modular curve X(r) has finitely many 
cusps. 

Proof. Exercise 2.4.1. □ 

The topology on H* consisting of its intersections with open complex disks 
(including disks {z : \z\ > r} U {00}) contains too many points of Q U {00} in 
each neighborhood to make the quotient X(r) Hausdorff. Instead, to put an 
appropriate topology on X(r) start by defining for any M>0a neighborhood 



Adjoin to the usual open sets in H more sets in H* to serve as a base of 
neighborhoods of the cusps, the sets 



and take the resulting topology on H* . Since fractional linear transformations 
are conformal and take circles to circles, if a(oo) e Q then (x(Nm U {00}) 
is a disk tangent to the real axis. (Figure 2.5 shows N\ U {00} and some of 
its SL 2 (Z)-translatcs; note how this quantifies the discussion leading up to 
Definition 1.2.3.) Under this topology each 7 e SL 2 (Z) is a homeomorphism 
of %* . Finally, give X{r) the quotient topology and extend natural projection 
to 7r : T~L* — > X(r). 




: m e Z 



x(r) = r\n* = r(r) u r\(Q u {oo}). 



Mm = {t£«: Im(r) > M}. 



a(M M U {00}) : M > 0, a € SL 2 (Z) 



Proposition 2.4.2. The modular curve X(T) is Hausdorff, connected, and 
compact. 
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Figure 2.5. Neighborhoods of oo and of some rational points 

Proof. The first statement requires distinct points X\,x 2 & X(r) to have 
disjoint neighborhoods. The case x\ — TYi, x 2 = rr 2 with ti,t 2 € "H is 
already established as Corollary 2.1.2. 

Suppose x\ = rsi, x 2 = Ft 2 with s\ e Q U {oo} and t 2 e "H. Then 
si = a(oo) for some a € SL 2 (Z). Let L/ 2 be any neighborhood of t 2 with 
compact closure if. Then the formula 

Im(7(r)) < max{Im(T), 1/Im(r)} for r e H and 7 e SL 2 (Z) 

(Exercise 2.4.2(a)) shows that for M large enough, SL 2 (Z)if n7V M = 0- Let 
Ui = a(Af M U {00}). Then 7r(?7i) and ir(U 2 ) are disjoint (Exercise 2.4.2(b)). 

Suppose x\ = rs\, x 2 = Ts 2 with si,s 2 e Q U {00}. Then si = ai(oo) 
and s 2 = a 2 (oo) for some ai,a 2 € SL 2 (Z). Let ?7i = ai(A/ 2 U {00}) and 
U 2 = a 2 {N 2 U {00}). Then w(Ui) and ir(U 2 ) are disjoint, for if 7Q!i(ti) = 
« 2 (t 2 ) for some 76^ and r l7 r 2 € A/2 then 70:1 takes Ti to t 2 and 
(since A/2 is tessellated by the integer translates of T> and contains no elliptic 
points) therefore must be ±[J™] for some m e Z. Thus 70:1 fixes 00 
and consequently 7(si) = s 2 , contradicting that x\ and x 2 are distinct. This 
completes the proof that X(r) is Hausdorff. 

Suppose H* — 0\ U 0 2 is a disjoint union of open subsets. Intersect with 
the connected set ri to conclude that 0\ D W and so 0 2 C Q U {00}. But 
then 0 2 is not open after all unless it is empty. Thus %* is connected and 
hence so is its continuous image X(r). 

For compactness, first note that the set V* = 2?U{oo} is compact in the W 
topology (Exercise 2.4.3). Since H* = SL 2 (Z)V* = \J j r^(V*) where the 7, 
are coset representatives, X(r) = Uj 7r (7j(^*))- Since each 7^ is continuous 
and 7r is continuous and [SL 2 (Z) : r] is finite, the result follows. □ 

Making X(r) a compact Ricmann surface requires giving it charts. Retain 
the coordinate patches ir(U) and maps ip : 7r(U) — > V from Section 2.2 for 
neighborhoods U C %. For each cusp s <G Q U {00} some 5 — S s € SL 2 (Z) 
takes s to 00. Define the width of s to be 

h s = h., r = |SL 2 (Z) co /(5{±/}r ( 5- 1 ) co |. 
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Figure 2.6. Local coordinates at a cusp 



(The r subscript is suppressed when context makes the group clear.) This 
notion is dual to the period of an elliptic point, being inversely proportional 
to the size of an isotropy subgroup. Recall that the period of an elliptic point is 
the number of sectors of the disk at the point that are identified under isotropy. 
At a cusp, infinitely many sectors come together, sectors most easily seen after 
translating to oo as unit vertical strips, and the width of the cusp is the number 
of such strips are that are distinct under isotropy (see Figure 2.6). The group 
SL2(Z)oo = {±/}([q \ ]) is infinite cyclic as a group of transformations, and so 
the width is characterized by the conditions {±I}(Sr6~ 1 ) 00 — {±/}([J ^]), 
h > 0. The width is finite (Exercise 2.4.4(a)) and independent of S since in fact 
h s = |SL 2 (Z) S /{±J}.T S | (Exercise 2.4.4(b)). If s € Q U {oo} and 7 e SL 2 (Z) 
then the width of j(s) under ^T^ 1 is the same as the width of s under r 
(Exercise 2.4.4(c)). In particular, h s depends only on _Ts, making the width 
well defined on X(r), and if r is normal in SL 2 (Z) then all cusps of X(r) 
have the same width. Now define U = U s = 5~ 1 (J\f2 U {00}) and as before 
define f/iasa composite ip = po 5 where this time p is the /i-periodic wrapping 
map p(z) — e 27rlz / h . Note that tp is exactly the change of variable embedded 
in condition (3) of Definition 1.2.3. Also as before, let V — im(ip), an open 
subset of C, to get 

ip:U — > V, ip(r) = e ^^)l h . 

As with elliptic points, ip mimics the identifying action of it. This time the 
fractional linear transformation 5 straightens neighborhoods of s by making 
identified points differ by a horizontal offset, and then the exponential map p 
wraps the upper half plane into a cylinder which, held to one's eye like a 
telescope, becomes in perspective a disk with 00 at its center. (Again see 
Figure 2.6, noting how the shape of the shaded sector in U motivates the 
term "cusp.") 

To confirm that tp carries out the same identification as tt about s, compute 
that for ti,T2 <G U, 
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tt(ti) = 7t(t 2 ) ri = 7(r 2 ) 5(n) = (5-y5 ^(Jfa)) 

for some 7 e f. But this makes 57<5 _1 a translation since <5(ti) and i5(r 2 ) both 
lie in 7V 2 U {00}. So (fytf- 1 G 5TS- 1 n SL 2 (Z) oc = (£7^-% C ±<[J }]) and 

7r(ri) = 7f(r 2 ) <J(ti) = <5(r 2 ) + m/i for some to € Z 

^(n) = V(r 2 ). 

As in Section 2.2 there exists a bijection <^ : tt(U) — > V such that the diagram 




commutes. As before, the coordinate neighborhood about 7r(r) in Y(r) is tt(U) 
and the coordinate map is ip : tt(U) — > V, a homeomorphism (Exercise 2.4.5). 
Again ip : U — > V is the composition ip = p o 8, and the local coordinate 
ip : n(U) — > V is defined by the condition <p o n = if). 

Checking that the transition maps are holomorphic is similar to the process 
carried out for coordinate patches U\, £7 2 C W in Section 2.2, but now at least 
one patch is a cusp neighborhood. 

Suppose Ui C H has corresponding 81 = 5 Tl € GL 2 (C) where t\ has width 
hi, and suppose [/ 2 = i5 2 ~ 1 (A/ 2 U {00}). As before, for each x <E tt(Ui) n 7r(J7 2 ) 
write x = 7r(fi) = 7t(t 2 ) with fi e f/i, r 2 e ?7 2 , and r 2 = 7(fi) for some 
7 £ T. Let [/ 1-2 = J7i n7 _1 (?7 2 ), a neighborhood of f 1 in H. Then Pi{^{Ui^)) 
is a neighborhood of ipi(x) in Vi j2 = <p>i(n(Ui) fl 7r(J7 2 )). Note that if /ii > 1 
then n <^ C/i ;2 , else the point <5 2 (7(ti)) G A/2 is an elliptic point for r, but 
A/2 contains no elliptic points for SL 2 (Z) (Exercise 2.4.6). So if hi > 1 then 
0 ^ ^1(^(^1,2))- As before, an input point pi(x') to 952,1 in Vi ;2 takes the 
form g = ((5 1 (r'))' H . This time the corresponding output is 

<P2{x') = (^ 2 (7t(7(t'))) = V2(7(^)) = cxp(27ri5 2 7(r')//i2) 
= cxp(2TriS 2l S^ 1 (q 1 / h ^/h 2 ). 

So the only case where the transition map might not be holomorphic is when 
hi > 1 and 0 G 1^1(^(^1,2)), but we have seen that this cannot happen. The 
discussion here also covers the case with the roles of Ui and U2 exchanged 
since the inverse of a holomorphic bijection is again holomorphic. 

Suppose Ui = S^ 1 (Af 2 U {00}) with Si : Si i-» 00 and U 2 = 8 2 ' 1 (Af 2 U 
{00}) with 8 2 : s 2 00. If 7r(f/i) n tt(U 2 ) ^ 0 then 7^ 1 (A/ 2 U {00}) meets 
8 2 ~ 1 (N2 U {00}) for some 7 € r, i.e., (^^f 1 moves some point in A2 U {00} to 
another and therefore must be a translation ± [ J ™]. So 7(si) = 7<5f 1 (oo) = 
±<^~ [J™] (00) = ,s 2 . It follows that hi = h 2 and the transition map takes an 
input point in <£>i(7r(t/i i2 )), 
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q = -01 (t) = exp(27r^i(r)//i), 

to the output point 

ipzilir)) = exp(27ri(527(5f 1 (<5i(t))//i) = exp(27ri(c>i(T) + m)/h) 

_ e 2-Kim/hq^ 

This is clearly holomorphic. 

For any congruence subgroup F of SL 2 (Z) the extended quotient A(F) is 
now a compact Riemann surface. Figure 2.7 summarizes the local coordinate 
structure for future reference. 

Topologically every compact Riemann surface is a g-holed torus for some 
nonnegative integer g called its genus. In particular, complex elliptic curves 
have genus 1. The next chapter will compute the genus of X(r) and study 
the meromorphic functions and differentials on X(r). This will give counting 
results on modular forms as well as being interesting in its own right. 



7T : W* — > X(r) is natural projection. 
U C H* is a neighborhood containing at most one elliptic point or cusp. 
The local coordinate ip : ir(U) ^> V satisfies ip o n = ip 
where ip : U — > V is a composition ip = po S. 


About t 0 e H : 


About s e Q U {oo}: 


The straightening map is z — S(t) 
where 6= \ ~ T ° , S(t 0 ) = 0. 
S(U) is a neighborhood of 0. 


The straightening map is z — S(t) 
where S e SL 2 (Z), S(s) = oo. 
S(U) is a neighborhood of oo. 


The wrapping map is q = p(z) 
where p(z) — z h , p(0) = 0 
with period h = |{±/}r T0 /{±/}|. 
V = p(S(U)) is a neighborhood of 0. 


The wrapping map is q = p(z) 
where p(z) = e ^ lz l } \ p(oo) = 0 

with width h = |SL 2 (Z) s /{±7}r s |. 

V = p(S(U)) is a neighborhood of 0. 



Figure 2.7. Local coordinates on X(r) 



Exercises 

2.4.1. Prove Lemma 2.4.1. 

2.4.2. (a) Justify the formula Im(7(r)) < max{Im(r), l/Im(T)} for r e H 
and 7 € SL 2 (Z) used in the proof of Proposition 2.4.2. 

(b) In the same proof, verify that 7r(i7i) and 7r(C/ 2 ) are disjoint. 
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2.4.3. Show that the set V* = V U {oo} is compact in the H* topology. 

2.4.4. (a) Show that the width h s is finite. (Hints for this exercise are at the 
end of the book.) 

(b) Show that h s = |SL 2 (Z) a /{±7}r s |. 

(c) Show that if s e Q U {oo} and 7 e SL 2 (Z) then the width of 7(3) 
under jTj^ 1 is the same as the width of s under r. 

2.4.5. Explain why ip : ir(U) — > V as defined in the section is a homeomor- 
phism. 

2.4.6. Show that A/2 contains no elliptic points for SL2(Z). (A hint for this 
exercise is at the end of the book.) 

2.4.7. Suppose that J~i and A are congruence subgroups of SL2(Z) and that 
7 is an element of GL^"(Q) (see Exercise 1.2.11) such that 7A7 -1 C A- Show 
that the formula At r 2 j(r) defines a holomorphic map AT(A) — > AT(A)- 

2.5 Modular curves and Modularity 

Recall from Section 1.5 that every complex elliptic curve E has a well defined 
modular invariant j(E). Recall the congruence subgroup Jo (AT) of SL 2 (Z) 
from Section 1.2 and its associated modular curve Xq(N). Complex analyt- 
ically the Modularity Theorem says that all elliptic curves with rational in- 
variants come from such modular curves via holomorphic maps, viewing both 
kinds of curve as compact Riemann surfaces, 

Theorem 2.5.1 (Modularity Theorem, Version X c ). Let E be a com- 
plex elliptic curve with j(E) G Q. Then for some positive integer N there 
exists a surjective holomorphic function of compact Riemann surfaces from 
the modular curve X 0 (N) to the elliptic curve E, 

X 0 {N) — > E. 

The function in the theorem is called a modular parameterization of E. This 
first incarnation of Modularity gives little indication of number theory. The 
statement will be refined in later chapters, with other objects replacing the 
modular curve AT 0 (A^) and with the field Q of rational numbers replacing C, 
as more ideas enter into the book. 
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For any congruence subgroup P of SL 2 (Z), the compactified modular curve 
X(r) is now a Riemann surface. The genus of X(r), its number of elliptic 
points, its number of its cusps, and the meromorphic functions and meromor- 
phic differentials on X(r) are all used by Riemann surface theory to determine 
dimension formulas for the vector spaces Mk(r) and Sk(r). 

Related reading: This chapter is based on Chapters 1 and 2 of [Shi73]. The 
material is also in [Miy89] and (less completely) [Sch74] . 

3.1 The genus 

Let r be a congruence subgroup of SL2(Z). Topologically the compact Rie- 
mann surface X(r) = r\H* is a sphere with g handles for some g £ N. This 
g, the genus of X(r), will figure in the pending dimension formulas for vector 
spaces of modular forms. 

Let / : X — > Y be a nonconstant holomorphic map between compact 
Riemann surfaces. As explained in Section 1.3, / surjects. For any y €Y the 
inverse image f~ 1 (y) is discrete in X since / is nonconstant and X is closed, 
and therefore f~ l {y) is finite since X is compact. In fact the map / has a well 
defined degree d & Z + such that = d for all but finitely many i/eY. 

More precisely, for each point x € X let e x <E Z + be the ramification degree 
of / at x, meaning the multiplicity with which / takes 0 to 0 as a map in 
local coordinates, making / an e x -to-l map about x, cf. the Local Mapping 
Theorem of complex analysis. Then there exists a positive integer d such that 



To see this, let £ = {x e X : e x > 1} denote the exceptional points in X, the 
finite set of points where / is ramified. Let X' = X\£ and Y' = Y\h(£) be 
the Riemann surfaces obtained by removing the exceptional points from X 




e x — d for all y EY. 
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and their images from Y. So X' is all but finitely many points of X, making 
it connected, and similarly for Y' . Let y be a point of Y' . Each x € 
has a neighborhood U x where / is locally bijective. Shrinking these neigh- 
borhoods enough, they can be assumed disjoint and assumed to map to the 
same neighborhood of y. Thus y has a neighborhood V in Y' whose inverse 
image is a finite disjoint union of neighborhoods U x in X' with each restriction 
fx ■ Ux — > V invertiblc. This makes the integer- valued function y M> \f~ 1 (y)\ 
on Y' continuous and therefore constant since Y' is connected. Let d be its 
value; so J2 x ef- 1 (y) e x = d f° r a U V <= Y' . The sum extends continuously to 
all of Y by definition of ramification degree: if y = f(x) then every y' in some 
neighborhood of y is the image of e x points x' near x, each with e x > = 1- 
Thus J2xef- 1 ( y ) e x — \f~ 1 {y')\ — d as claimed, provided that y' is in a small 
enough neighborhood of y to avoid ramification except possibly over y itself. 

Again let / : X — > Y be a nonconstant holomorphic map between com- 
pact Riemann surfaces, of degree d. Let gx and g Y be the genera of X and Y. 
To compute the genus of the modular curve X(r) we will use the Riemann- 
Hurwitz formula, 

2g x ~2 = d(2g Y - 2) + ^ (e x - 1). 

x£X 

This shows, for instance, that there is no nonconstant map to a surface of 
higher genus, that a nonconstant map between surfaces of equal genus is 
unramificd, and that there is no meromorphic function of degree d = 1 on 
any compact Riemann surface of positive genus. We sketch a proof of the 
Riemann- Hurwitz formula for the reader with some background in topology: 
Triangulate Y using Vy vertices including all points of /(£), Ey edges, and 
Fy faces. This lifts under f~ x to a triangulation of X with Ex = dEy edges 
and F x = dF Y faces but only V x = dVy — X^ex^ ~ -0 ver tices because 
of ramification. The Riemann-Hurwitz formula follows because 2 — 2g x = 
Fx — E x + Vx and similarly for Y. 

As a special case of all this, if ri and _T 2 are congruence subgroups 
of SL 2 (Z) with ri C r 2 , then the natural projection of corresponding modular 
curves, 

/ : — ► x(r 2 ), r 1 T^r 2T , 

is a nonconstant holomorphic map between compact Riemann surfaces, cf. Ex- 
ercise 2.4.7. Its degree is (Exercise 3.1.1) 



deg(/) = [{±I}r 2 : {±7}A] 



[r 2 :A]/2 if -7 e T 2 and -/ ^ A, 
[A : A] otherwise. 



Recall from Figure 2.7 the local structure on the Riemann surfaces X(A) for 
j = l, 2. The commutative diagram 
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shows that /i oca i o tp 1 = tp 2 , where ipj is the composition pj o 5. If U is a 
neighborhood of r £ H then pi(z) = z hl and /9 2 (z) = -z 712 so the local map is 
q i Y q h z/ h i. Let 7j ;T denote the fixing subgroup of r in for j = 1,2. Then 
the periods /ij = |{±/}r^ T |/2 lie in {1,2,3} and since their ratio h 2 /hi is 
integral this implies hi = 1 or hi = h 2 . So the ramification degree is 



-7Ti(t) 



/i 2 //ii 



if t is an elliptic point for r 2 but not for 
otherwise 



= [{±/}r 2 , r : {±/}A, T ]. 



Since the periods hj are defined on the curves Xj, the ramification degree is 

indeed defined on Xi. If U is a neighborhood of s e Q U {00} then pi(z) = 
e 27r»*/hi and p2 (^ = e 27r lz // l2 g0 the local map ig ? ^ g /ti/h 2 _ Hcrc the w idths 

are hj = [SL 2 (Z)oo : {±/}_r^ s ] for j = 1,2 and the ramification degree is the 
reciprocal of their ratio, 



3 7Tl(s) 



/n//i 2 = [{±/}r 2;S : {±/}A, s ]. 



Again this is well defined on X x . If I\ is normal in _T 2 then all points of X{T\) 
lying over a given point of X(r 2 ) have the same ramification degree (Exer- 
cise 3.1.2). 

To compute the genus of X(r), further specialize to A = f and F 2 = 
SL 2 (Z). Let y 2 = SL 2 (Z)i, y 3 — SL 2 (Z)^ 3 , and = SL 2 (Z)oo be the elliptic 
point of period 2, the elliptic point of period 3, and the cusp of X(l) = 
SL 2 (Z)\"H*. Let e 2 and e 3 be the number of elliptic points of r in f~ 1 (y 2 ) 
and / _1 (j/3), i.e., the number of elliptic points of period 2 and 3 in X(r), and 
let Soo be the number of cusps of X(r). Then recalling that d = deg(/) and 
letting ft = 2 or h = 3, the formula for d at the beginning of the section and 
then the formula for e ffl ( T ) at the nonelliptic points and the elliptic points 
over SL 2 (Z)y/j show that (Exercise 3.1.3(a)) 

d= e x = h-{\f~ 1 {y h )\-e h ) + l-e h , 

and using these equalities twice gives 

]T («* -!) = (/*- l)(ir 1 (l»OI - e h ) = ^(d e h ). 

xef-Hvh) 
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Also, 

{e x -l)=d-s 00 . 

Since X(l) has genus 0, the Riemann-Hurwitz formula now shows 

Theorem 3.1.1. Let r be a congruence subgroup o/SL2(Z). Let f : X(r) — > 
X{1) be natural projection, and let d denote its degree. Let e 2 and £3 denote 
the number of elliptic points of period 2 and 3 in X{r), and the number 
of cusps ofX(r). Then the genus of X(r) is 

1 , d £2 £3 £00 

Proof. Exercise 3.1.3(b). □ 

The next result will be used in Sections 3.5 and 3.6. 
Corollary 3.1.2. Let r, g, £ 2 , £3, and be as above. Then 

25 - 2 + y + ^ + £co > 0. 

Proof. Exercise 3.1.3(c). □ 

Exercises 3.1.4, 3.1.5, and 3.1.6 compute the genus of X(r) for r = -To(p), 
r = A(p), and r = r(p) where p is prime. The results for general N will 
take considerably more work. 

A fundamental domain for Io(13) is shown in Figure 3.1, including repre- 
sentatives of the two elliptic points of order 2 (the light dots) and the two of 
order 3 (the dark ones — Exercise 3.1.4(f) explains how the figure was gener- 
ated and justifies the description to follow in this paragraph and the next). 
Each boundary arc with an order-2 elliptic point folds together at the elliptic 
point, identifying its two halves with each other, and each pair of boundary 
arcs joined at an order-3 elliptic point folds together at the elliptic point, iden- 
tifying with each other. The remaining two curved boundary arcs in the left 
half that don't include the cusp 0 are identified with each other, and similarly 
in the right half. The vertical boundary segments in the left half identify with 
those in the right half, and so the remaining two arcs are identified with each 
other as well. Gluing boundary arcs together under these identifications gives 
a sphere per the formula in Exercise 3.1.4(e). 

The 13 points of SL 2 (Z)(i) in the figure identify to eight points of A 0 (13), 
and the 14 points of SL 2 (Z)(^ 3 ) in the figure identify to six points of A 0 (13). 
The boundary arc for each order-2 elliptic point x folds together like the local 
coordinate about i in X(l) so that the ramification degree of the projection 
/ : Xo(13) — > X(l) at these points is e x = 1, while e x = 2 at the other six 
points x that project to SL 2 (Z)(i) and the ramification degrees sum to 14 = 
[SL 2 (Z) : (13)] as they should. Each pair of boundary arcs joined at an 
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Figure 3.1. Fundamental domain for To (13) 

order-3 elliptic point x folds together so that e x — 1, while e x — 3 at the 
other four points x that project to SLi2(Z)(/i3) and again the sum is 14. The 
ramification degrees at the cusps Jo(p)(oo) and 7o(13)(0) are visibly 1 and 13. 

To generate pictures of fundamental domains, see Helena Verrill's web page 
[Ver]. 

Exercises 

3.1.1. Show that natural projection / : A(J\) — > A(_T 2 ) has the degree 
claimed in the text. (A hint for this exercise is at the end of the book.) 

3.1.2. Show that if I\ is normal in _T 2 then all points of A(/\) lying over 
a given point of X(r 2 ) have the same ramification degree. (A hint for this 
exercise is at the end of the book.) 

3.1.3. (a) Fill in details as necessary in the computations preceding Theo- 
rem 3.1.1. 

(b) Prove Theorem 3.1.1. 

(c) Prove Corollary 3.1.2. 
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3.1.4. Let p be prime, let X 0 (p) = r 0 (p)\H* , let a 3 ■ = [)\] for j = 0, . . . ,p- 
1, and let — [ \ 

(a) Show that SL 2 (Z) = (J . rb(p)ct,-, a disjoint union. 

(b) Show that X 0 (p) has exactly two cusps. (Hints for this exercise are at 
the end of the book.) 

(c) Show that jotj(i) = aj(i) for some 7 € r 0 (p) of order 4 if and only if 
j 2 + 1=0 (mod p). Thus the number of elliptic points of period 2 in X 0 (p) is 
the number of solutions of the congruence x 2 + 1 = 0 (mod p). This number 
is 2 if p = 1 (mod 4), 0 if p = 3 (mod 4), and 1 if p = 2. 

(d) Show that jctj (^3) = ctj(^) for some 7 € A(p) of order 6 if and only 
if — j + 1 = 0 (mod p). Thus the number of elliptic points of period 3 in 
Xq (p) is the number of solutions of x 2 — x + 1 = 0 (mod p) . This number is 
2 if p = 1 (mod 3), 0 if p = 2 (mod 3), and 1 if p = 3. Along with part (c), 
this shows that the number of elliptic points is determined by p (mod 12). 
The example p = 13 in the text is the smallest where all four possible elliptic 
points exist. 

(e) Let g be the genus of X 0 (p) and let k — p+1. Show that 




Lt|J-1 if k = 2 (mod 12), 
L^J otherwise. 



(The symbol k is introduced here because the same formula with k as the 
weight will recur later in the chapter.) 

(f) Figure 3.1 was generated using the coset representatives 

&=[}?][?-£]> i = "6,...,6, 

and ,000 = [J "1] a oo [1 ~o]- Find each translate !3j{V) in the figure, where 
T> is the fundamental domain from Chapter 2. Show that the 13 points 
of SL 2 (Z)(z) in the figure are for j = —6,..., 6. Since [j - 0 ] fixes i, 

the elliptic points of order 2 are flj(i) when j 2 + 1 = 0 (mod 13). Show that 
jf3j(i) — (3j'(i) for some 7 e -To (13) of order 4 if and only if jf + 1 = 
0 (mod 13). Use this to partition the 13 points of SL 2 (Z)(z) in the figure into 
eight equivalence classes under Jo (13), five with two points each where the an- 
gle is 7r, giving a total of 2tt; one with one point where the angle is 2ir; and two 
with one point where the angle is ir as it is at i in T>, representing the unrami- 
fied points. Identify the boundary arcs pairwise except for two arcs that fold in 
on themselves. Note that SL 2 (Z)(/X3) = SL 2 (Z)(/ig). Since [j ~J] takes /j.^ to 
fis, the elliptic points of order 3 are /3j(/j,e) when j 2 — .7 + 1 = 0 (mod 13). Show 
that the 14 points of SL 2 (Z)(^ 3 ) in the figure are (3j(ne) for j = —6, . . . , 7. 
Show that 7/3j(/i6) = ftj'ine) for some 7 <G Io(13) of order 3 or 6 if and only 
if jj' — .7 + 1 = 0 (mod 13) or jf — j' + 1 = 0 (mod 13). Use this to partition 
the 14 points of SL 2 (Z)( / U3) in the figure into six equivalence classes under 
Jo (13), one with the four points where two of the angles are 27r/3 and two of 
the angles are 7r/3, giving a total of 2ir; two with with three points where the 
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angle is 2n/3, again giving 2ir; one with the two points where the angle is n, 
giving 2ir; and two classes with one point each where the angle is 27r/3 as it is 
at /i3 in T>, representing the unramified points. Determine whether each pair 
of boundary arcs is identified with orientation preserved or reversed. Show 
that under identification the figure is topologically a sphere. 

(g) Carry out the same process for X 0 (ll) and Xq(17), convincing yourself 
that they are tori in accordance with part (e). 

3.1.5. Let p be prime and let X^p) = A(p)\?T. 

(a) Show that if 7 € SL 2 (Z) S where s e Q U {00} (and as in Chapter 2 the 
group is its isotropy subgroup) then 7 has trace ±2. Show that if p is a prime 
then r 0 (p) s = {±I}ri(p) s . Deduce that natural projection / : X\(p) — > 
Xq(p) is unramified at the cusps. Combine this with part (b) of the previous 
problem to show that X\{p) has p — 1 cusps if p > 2 and 2 cusps if p = 2. 

(b) Exercise 2.3.7 showed that Xi(p) has no elliptic points if p > 3, that 
Xi(3) has no elliptic points of period 2 and one elliptic point of period 3, and 
that X\ (2) has one elliptic point of period 2 and no elliptic points of period 3. 
Combine this information with part (a) to compute the genus of X\ (p) . (A 
hint for this exercise is at the end of the book.) 

3.1.6. Let p be prime and let X(p) = r(p)\H* ■ Compute the genus of X(p). 
(A hint for this exercise is at the end of the book.) 



3.2 Automorphic forms 

This section defines automorphic forms, functions on the upper half plane T-L 
similar to modular forms but meromorphic rather than holomorphic. Auto- 
morphic forms of positive even weight k with respect to a congruence sub- 
group r are closely related to meromorphic differentials of degree k/2 on the 
quotient Riemann surface X (r), to be described in the next section. Sec- 
tions 3.5 and 3.6 will exploit the relation to compute dimension formulas for 
spaces of modular forms. 

Let C denote the Riemann sphere CU{oo}. Recall that for an open subset 
V C C, a function / : V — > C is meromorphic if it is the zero function or 
it has a Laurent series expansion truncated from the left about each point 

t e v, 

00 

f(t) = a n{t - T) n for all t in some disk about r, 

n—m 

with coefficients a n e C and a m ^ 0. The starting index m is the order of f 
at t, short for "order of vanishing" and denoted v T (f); the zero function is 
defined to have order v T (f) = 00. The function / is holomorphic at r when 
v r{f) > 0, it vanishes at r when v T (f) > 0, and it has a pole at r when 
v T (f) < 0. The set of meromorphic functions on V forms a field. 

Let r be a congruence subgroup of SL 2 (Z). Recall the weight-fc operator, 
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(fh]k)(r) = , ? (7, r)- fe /(7(r)), j(% r) = cr + d for 7 = [ . 

As in Chapter 1 , a meromorphic function / : H — > C is called weakly modular 
of weight k with respect to r if f["f]k = f for all 7 e _T. To discuss meromorphy 
of / at 00, let h be the smallest positive integer such that [J \ ] e r. Thus / 
has period h. If also / has no poles in some region {r e "H : Im(r) > c} then 
/ has a Laurent series on the corresponding punctured disk about 0, 

00 

/(r)= £ a„g£ if Im(r) > c, where q h = e™ T ' h . 

n— — 00 

Then / is meromorphic at 00 if this series truncates from the left, starting at 
some to € Z where a m 7^ 0, or if / = 0. The order of f at 00, denoted Uoo(f), 
is again defined as the starting index to except when / = 0, in which case 
v oo{f) = 00 • Now automorphic forms are defined the same way as modular 
forms except with meromorphy in place of holomorphy. 

Definition 3.2.1. Let r be a congruence subgroup o/SL2(Z) and let k be an 
integer. A function f : % — > C is an automorphic form of weight k 
with respect to r if 

(1) / is meromorphic, 

(2) / is weight-k invariant under r , 

(3) f[a]k is meromorphic at 00 for all a e SL 2 (Z). 

The set of automorphic forms of weight k with respect to r is denoted Ak{r) . 

Condition (3) makes sense since f[a]k is meromorphic on T-L and weakly 
modular with respect to a^ 1 Ta, a congruence subgroup of SL 2 (Z). For any 
cusp s e Q U {00}, the order of / at s is defined as 

v s {f) = Voo{f[ot]k) where a(oo) = s, a e SL 2 (Z). 

This is independent of the choice of a and well defined on the quotient X(r) 
(Exercise 3.2.1). Condition (3) implies beyond Condition (1) that the poles 
of / in W cannot cluster at any s e Q U {00}. 

Setting k = 0 in the definition shows that >4o(-0 is the field of meromorphic 
functions on X(r), denoted C(X(r)). At elliptic points and cusps, where the 
local coordinate is many-to-one, functions / € Ao(r) are correspondingly 
periodic and thus meromorphic in the local variable of the Ricmann surface 
structure. This statement will be elaborated later in the section. 

For example, recall the modular invariant, defined as 



3.2 Automorphic forms 73 



where #2 and A are the weight 4 Eisenstein series and the weight 12 cusp 
form from Chapter 1 . Since the numerator and denominator are both wcight- 
12 invariants under SL 2 (Z) and holomorphic at 00, the quotient is SL 2 (Z)- 
invariant and meromorphic at 00. Thus j is an automorphic form of weight 0 
with respect to the full modular group SL 2 (Z) and is naturally viewed as 
a meromorphic function on the quotient, j : X(l) — > C. Since A is a cusp 
form, j has a pole at 00. By Corollary 1.4.2, stating that A is nonzero at every 
point of W, this is its only pole. The leading 1728 normalizes its Laurent scries 
to (Exercise 3.2.2) 

fa) = " + E a «9"< »»eZ,? = e 2 — . (3.1) 

Since its one pole is simple, j has degree 1 as a map j : X(\) — > C of 
compact Riemann surfaces, cf. the beginning of Section 3.1. It follows that j is 
a homcomorphism and conformal since it is locally analytic with nonvanishing 
derivative, and it takes Y(l) to C and 00 to 00. In fact 

A(SL 2 (Z)) = C(j), 

that is, the field of meromorphic functions on X(l) is the set of rational 
functions of j. The containment C(j) C >4o(SL 2 (Z)) is clear. For the other 
containment, let / be meromorphic and nonconstant on X(l) with finite zeros 
Z\ , . . . , z m and finite poles pi , . . . , p n , listed with multiplicity. Consider a 
function g £ C(j), 

Then g has the same zeros and poles as / away from SL 2 (Z)oo, so g must 
vanish to the same order as / at SL 2 (Z)oo as well since for both functions the 
total number of zeros minus poles is 0. Thus f/g : X(\) — > C has no zeros 
or poles, making it constant and showing that / € C(j). 
Similarly, Chapter 7 will show that 

A 0 (r 0 (N)) - C(j,j n ), where j n (t) = j(Nr). 

Chapter 7 will also describe A Q (r 0 (N)), _4 0 (A(AO), and -A 0 (r(N)) using the 
functions /o, /q , and /q from the end of Section 1.5. 

Returning to the general discussion, the sum A(r) = Ak(r) forms 

a ring. The derivative j' lies in _4 2 (.T), and it follows that Ak(r) contains 
nonzero elements for all even positive integers k (Exercise 3.2.3). For any 
k e Z, if / is a nonzero element of A k (r) then A k (r) = C(X(r))f = 
{fof ■ fa € C(X(r))} (Exercise 3.2.4). These facts will be used to compute 
dimension formulas later in the chapter. 

The transformation law /(7(t)) = j(7j T ) k f( T ) shows that an automorphic 
form / with respect to a congruence subgroup _T is generally not well defined 
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on the quotient X(r). (The exceptional cases are when k = 0 or / = 0.) 
However, the order of vanishing of any automorphic form / does turn out to 
make sense on the quotient. Defining this notion appropriately requires taking 
the Riemann surface structure of X(r) into account. 

Let 7t(t) <G X(r) be a noncusp. Since the factor of automorphy j(-f, r) has 
no zeros or poles on "H, the order of / is the same at all points of -T(r). The 
local coordinate at 7t(t) is essentially q — (t — t) h where t has period h as 
an elliptic point of r. Excluding the zero function, since f(t) = a m (t — r) m + 
• • • = a m q m / h + ■ ■ ■ with a m ^ 0 the natural definition of the order of / at 
7t(t) e X(r) in terms of the order to of / at r <E W is m/h, that is 

IS ( /") 

v ir(T){f) = — where r has period h. (3.2) 

Thus Vk(t){J) = v r{f) except at the elliptic points, where v v ^{f) can be 
half-integral or third-integral. As remarked earlier, if k — 0 then h divides to 
and i^r( T )(/) is integral. 

Defining the order of / at cusps tt(s) £ X(r) is trickier. First consider 
the cusp 7r(co). Its local coordinate is = ^ lT l h where h is the width, 
characterized by the conditions 

{±I}r 00 = {±I}([l1]), heZ+. 

But since the negative identity matrix — / might not lie in r this implies only 

r oo = {±7}([^]) or ^ = {[5 J]) or ^ = (-[1^]). 

Thus the width h here is not necessarily the same as the period h from the 
beginning of the section. In all three cases the orbits Ft and F{t + h) for 
t e % are equal. In the first two cases, / has period h compatibly with this. 
But in the third case, which can arise only when —I ^ r (see Exercise 3.2.5 
for an example), the factor of automorphy j{— [J \ \ ,r) = — 1 shows that if k 
is even then again / has period h, while if k is odd then /(r + h) = — f (r) 
so that / has skew-period h but period 2h. In all cases set b! = 2h, so h' 
is twice the width and usually twice the period, and set qy — e 27Tl ^ h . Then 
/(t) = g(qh') for Im(r) > c where q h > = e 27TlT / h and c is large enough to 
avoid the finite poles of /, and g has Laurent expansion 

oo 

g(ih') = anq h' ' a ™ ^ °- 

n—ra 

1 /2 

Since q^ — q h is the square root of the local variable, the order of / at 7r(oo) 
is naturally defined as to/2, half the lowest power of q^, except when / = 0 
in which case ^ 7r ( 00 )(/) = oo. The relation /(r + h) = g(-qh') shows that 
when / in fact has period h, the function g is even and ^7r(oo)(/) is the integer 
Vco{f) from before. In the exceptional case where = (— [J ^]) and k is 
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odd, the skew periodicity f(r + h) = —f(r) shows that g is odd and ^ 7r ( 00 )(/) 
is ^oo(/)/2, possibly a half-integer. 

For a general cusp ir(s) G X(r) where s e Q U {oo}, take any matrix 
a e SL 2 (Z) such that a(oo) = s. Using a to conjugate the discussion above 
from oo to s gives the order of / at ir(s) <G X(r) in terms of the order of / 
at s e Q U {oo}, 

(/) = l^ (/)/2 ^ ( a " lra )°° = ^[o?]) and fc is odd, 
w ^ ' otherwise. 

This can be half-integral in the exceptional case, when ir(s) or s itself is called 
an irregular cusp of T. For example, when fc is odd 1 /2 is an irregular cusp 
of A (4) by Exercise 3.2.5. Section 3.8 will show that this is the only example 
of an irregular cusp for the groups r 0 (N), i~i (AT), and r(N). Again if fc = 0 
then tV( s )(/) is integral. 

The results so far in this chapter lead to more examples of modular forms. 
Recall the Dedekind function j](t) = Q24 Il^i (1 — <?") anc ^ the discriminant 
function A{t) = (2ir) 12 V (T) 2i = (2ir) 12 q fl~ =1 (l - g™) 24 from Chapter 1, 
where as always r € H and g = e 2lTlT , and g 24 = e 27 " T / 24 . 

Proposition 3.2.2. Le£ fc and A~ be positive integers such that k(N + l) = 24. 
Thus (k,N) is one of (1,23), (2,11), (3,7), (4,5), (6,3), (8,2), or (12,1). 
Define a function 

(a) // <Sfe(/i(iV)) is nonzero then <Sfe(ri(iV)) = Ctpfe. 

(b) IfS k (r 0 (N)) is nonzero then S k (r 0 (N)) = S^r^N)) = Op k . 

Proof. Consider the function <? = </?^ +1 , i.e., 

00 

g(r) = (2n)- 2i A(t)A(Nt) = q N +' g") 24 (l - q Nn ) 2 \ 

71=1 

Since A(t) e Si 2 (SL 2 (Z)) it follows that A(Nt) e S 12 (r 0 (N)) and so g e 
<S 24 (r 0 (AO) C 5 24 (A(iV)). Let 5 = [? "J] and g w = e^W^ an d compute 
(Exercise 3.2.6) 

OO 

(5[S] M )(r) = AT^r/iV) = N^q^ 1 g^) 24 (l - g£") 24 . 

n=l 

Let 7To : %* — > Xq(N) and 7Ti : "H* — >• X\(N) be the natural projections, 
e.g., 7r 0 (r) = r 0 (iV)(r) for t e H* . All cusps of AW and A (AT) are regular 
since N ^ 4. If AT = 1 then the only cusp of X n (N) is 7r 0 (oo), of width 1. 
Otherwise A~ is prime and Exercise 3.1.4 shows that the cusps of Xq(N) are 
7To(oo), of width 1, and 7r 0 (0), of width N. Exercise 3.1.5 shows that the 
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natural projection tt : X\(N) — > X 0 (N) is unramificd at the cusps when TV 
is prime, and the same assertion holds trivially when TV = 1. Thus if tvi(s) is a 
cusp of Xi(N) over 7r 0 (oo) then it has width 1, s = a(oo) for some a £ r 0 (N), 
and the identity g[a]24 = 9 combines with the displayed product form of g to 
show that ^ 7ri ( s )(.g) = N + 1 by (3.3) since the cusp is regular, i.e., g has a 
zero of order N + 1 at ni(s). Similarly, if tti(s) is a cusp over 7To(0) then it 
has width TV + 1, s = a(0) = aS(oo) for some a £ r 0 (N), and the identity 
(/[aS*] 24 = g [S] 24 combines with the displayed product form of g [S] 24 to show 
that also i' 7ri ( s )(g) = N + 1 and again g has a zero of order N + 1 at wi(s). 

Suppose / e Sfc(Ji(i\0). Then / Ar+1 e S 24 (A(-/V)). The quotient Z^+Vff 
lies in _4 0 (A(TV)) = C(Xi(TV)), it is holomorphic on Yi(N) because A is 
nonzero on H, and it is holomorphic at the cusps of Xi(N) since f N+1 has 
zeros of order at least TV + 1 at the cusps. The only holomorphic functions 
on the compact Riemann surface Xi(N) are the constants, so f N+1 = eg for 
some c £ C. Taking (TV + l)st roots gives /(r) = C/U^y ^fc(r) for each r£H 
where /x/v = e 27Tl / N and e(r) is a continuous integer-valued function of r, 
making it constant, and c denotes different constants in different places. That 
is, / = cipk, proving part (a). Part (b) follows immediately since Sk(r 0 (N)) C 
5 fe (A(TV)). □ 

The proposition does not show that (fk is a modular form at all until 
we know by some other means that a nonzero modular form of the desired 
type exists; the only possible / in the proof could be / = 0, in which case 
the relation / = cipk does not imply ip^ = cf. When (fc, TV) = (12,1) the 
proposition shows that 6>i2(SL 2 (Z)) = CA since A is a nonzero cusp form of 
weight 12. The dimension-counting formulas to be obtained in this chapter 
will show that 5fe(Jo(TV)) is 1-dimensional when (fc, TV) is any of (2, 11), (4, 5), 
(6,3), (8,2), so part (b) of the proposition applies in these cases. Although 
iSfe(-To(TV)) = {0} when k is odd since —I E Jo (TV), the dimension formulas will 
also show that S3 (A (7)) is 1-dimensional, and part (a) applies. The formulas 
will not help with the remaining case Si (A (23)), but here one can still verify 
directly that tpi lies in the space (though we omit this) and so the proposition 
says that it spans the space. Thus ipk spans Sfc(A(-W)) whenever fc(TV+ 1) = 
24, and it also spans Sk(r 0 (N)) when k is even. 

Exercises 

3.2.1. Let / e Ak(r) be an automorphic form and let s £ Q U {00} be a 
cusp. Show that the definition of v s {f) given in the text is independent of the 
choice of a and well defined on the quotient X(r). 

3.2.2. Show that the modular invariant j has Laurent series (3.1). (A hint for 
this exercise is at the end of the book.) 

3.2.3. Let TT be a congruence subgroup of SLi2(Z). Show that the derivative of 
the modular invariant j lies in ^(A- Show that consequently Ak(r) contains 
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nonzero elements for all even positive integers k. (A hint for this exercise is 
at the end of the book.) 

3.2.4. Show that for any k € Z, if f is a nonzero element of Ak(r) then 

Mn = c(x(r))f. 

3.2.5. Let a = and let T = a^T^a. Show that r x = (-[J J]). (A 
hint for this exercise is at the end of the book.) 

3.2.6. Verify the expression for g[S']24 in the proof of Proposition 3.2.2. 
3.3 Meromorphic differentials 

Let r be a congruence subgroup of SL 2 (Z). The transformation rule for au- 
tomorphic forms of weight k with respect to _T, 

does not make such forms /^-invariant. On the other hand, Section 1.1 observed 
that d"f{r) = j(-f 7 r)~ 2 dT for 7 e r so that at least formally the differential 

/(r)(dr) fe / 2 

is truly _T- invariant. This section makes sense of such differentials on the 
Ricmann surface X(r). Doing so requires care at the elliptic points and the 
cusps. 

The first step is to define differentials locally. Let V be any open subset 
of C and let n e N be any natural number. The meromorphic differentials 
on V of degree n are 

ft® n (V) = {.f(q)(dq) n : f is meromorphic on V} 

where q is the variable on V. These form a vector space over C under the nat- 
ural definitions of addition and scalar multiplication, f(q)(dq) n + g(q)(dq) n = 
(/ + g)(<l)(dq) n and c(f(q)(dq) n ) = (cf)(q)(dq) n . The sum over all degrees, 

Q(V) = 0 Q® n (V), 

neN 

naturally forms a ring under the definition (dq) n (dq) m = (dq) n+m . 

Since a Ricmann surface involves not only local patches but also transition 
maps between them, we next need to study maps between local differentials. 
Any holomorphic map ip : V\ — > V2 between open sets in C induces the 
pullback map in the other direction between meromorphic differentials, 

if* : Q® n (V 2 ) — > tt®"(Vi), 
P*(/(<fc)(d<a) n ) - f(v(qi)W(qi)nd qi ) n . 
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In other words, taking the pullback means changing variables in the meromor- 
phic differential, doing so compatibly with the change of variable formula for 
integrals. The pullback is obviously linear. The pullback is also contravariant, 
meaning that if f\ = f is as before and p 2 ■ V 2 — > V3 is also holomorphic 
then (ip 2 0 f\)* — ft 0 f 2 (Exercise 3.3.1(a)). If V\ C V 2 and l : V\ — > V 2 is 
inclusion then its pullback is restriction, l*(uj) = u\ Vl for w G f}® n {V 2 ) (Ex- 
ercise 3.3.1(b)). It follows that if f is a bijection, making its inverse holomor- 
phic as well by complex analysis, then (ip*)^ 1 = (f^ 1 )* (Exercise 3.3.1(c)). 
If 7T : V\ — > V 2 is a holomorphic surjcction of open sets in C then n* is an 
injection (Exercise 3.3.1(d)). 

Now we can piece together local differentials on a Riemann surface A. Let 
A have coordinate charts fj : Uj — > Vj, where j runs through some index 
set J, each Uj is a neighborhood in X, and each Vj is an open set in C. A 
meromorphic differential on X of degree n is a collection of local meromorphic 
differentials of degree n, 

jeJ 

that is compatible. To define this, let Vj^k — fj(Uj n Uk) and Vkj = fk(Uj n 
Uk) for j,k e J. Then the compatibility criterion is that pulling back any 
transition map 

V h k ' J > Vkj 

to get the corresponding map of local meromorphic differentials 
gives 

fi.ji^kWu,,) = uj\ Vj , k - 

The meromorphic differentials of degree nonX are denoted i?® n (X). Again 
this set forms a complex vector space and the sum Q{X) = (-R {2® n (X) 
forms a ring. 

For example, when A is a complex elliptic curve C/ 'A the coordinate maps 
fj : Uj — > Vj are homeomorphic local inverses to natural projection ir : 
C — > X. Let w = (dzj)j e j. Then u satisfies the compatibility criterion since 
the transition maps between coordinate patches take the form z n- z + X for 
A e A, and these pull dz k back to dzj. Thus the differential dz makes sense 
globally on a complex torus even though a holomorphic variable z does not. 

Returning to the congruence subgroup _T of SL 2 (Z), we now map each 
meromorphic differential uj on X(r) to a meromorphic differential /(t)(c?t)™ 
on %, the _T-invariant object on % mentioned at the beginning of the section. 
The map is the pullback of the natural projection 7r : % — > X(r), 
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ir* ■. rt® n (jr(r)) — -> s?® n (H). 

To define this, recall from Chapter 2 that a collection of coordinate neigh- 
borhoods on X(r) is {^(Uj) : j <G J} where each Uj C H* is a neighbor- 
hood of a point -n,- e "H or of a cusp Sj £ QU {°°}; the local coordinate 
map ifj : Tt(Uj) — > Vj is characterized by the relation ipj — ipj o ir where 
ipj : Uj — > Vj mimics the identifying action of tt but maps into C. Now 
let uj = (uij)j(zj be a meromorphic differential on X(r). For each j € J set 
U- = Uj H H and V- = ipj{U'j) and ujj = ujj\ v >. Then the pullback tt*{uj) is 
defined locally on % as 

Tr'MIt,/ = ^(u/j) for all (3.4) 

To see that (3.4) gives a well defined global meromorphic differential tt*(uj) = 
/(r)(dr)™ on H, consider the commutative diagram 



u, n u k 



(3.5) 



^^^^{U^U k )—fV k 



The transition map ipk,j = <£Wj Iv, k satisfies ipkj °ipj — ipk on Uj n and 
pulling back gives ^ = il>jOip* kj on V" fe 3 . Therefore, letting Vj k = ^j(U'j^U' k ) 
<mdV k \j=MU^U' k ), 

iPU^klv^) = ^(^fcjMv^.)) = ^(Wjlv/J by compatibility, 

so the overlapping local pullbacks have a common value f(T)(dT) n \u'.nu' an d 

j k 

the global pullback tt*(lu) = /(r)(dr)™ is well defined. 

Since the pullback comes from an object on the quotient X(r) it must be 
r-invariant. That is, for any j £ T, 

/(r)(dr) n = 7*(/(r)(dr)") = (/(7W))(V(r)) n (dr) n 

= .7(7,r)- 2 "/( 7 (r))(dr)" = (/[ 7 ] 2 „)(r)(dr)". 

Thus the meromorphic function / defining the pullback is weakly modular of 
weight 2n. This is the motivation promised back in Chapter 1 for the definition 
of weak modularity, at least for even weights. 

The function / also satisfies condition (3) in Definition 3.2.1, that /[a]2n 
is meromorphic at oo for any a G SL 2 (Z). To see this, let s = a(oo). The 
local map ip : U — ► V about s takes the form i/) = poj : t H z ^ g with 
S = oT x and p(z) — e 27r W l where h is the width of s. Since uj is meromorphic 
at the cusps of X(r), the local differential uj\ v takes the form g(q)(dq) n 
where g is meromorphic at 0. The function / on U — {s} comes from pulling 



80 3 Dimension Formulas 



back w|y_{ 0 } under ip to f(r)(dT) n . Computing this gives / = f[6]2 n where 
(Exercise 3.3.2(a)) 

f(z) = (2m/h) n q n g(q), q = e 2 ™' h . 

Thus (f[a]2n)(z) = f(z) is meromorphic at oo as claimed. In sum, every 
meromorphic differential lo of degree n on X(r) pulls back to a meromor- 
phic differential 7r*(o>) = /(r)(dr) n on H where / is an automorphic form of 
weight 2n with respect to r. 

The calculation that tt*{lu) is well defined can be turned around (Exer- 
cise 3.3.2(b)) — given a collection (u>j) € YljeJ M® n (Vj) of local meromorphic 
differentials, exclude the cusps by setting Uj = Uj C\% and V- — ipj(Uj) and 
u)j = 0Jj\vj for all j € J- If the u)j pull back under -0* to restrictions of some 
meromorphic differential /(r)(dr) n on "H then the original local differentials 
cjj are compatible and so (o^-) € i7® n (X(7 n )). Thus, a given collection (wj) of 
local meromorphic differentials is compatible, giving a meromorphic differen- 
tial on X(r), if and only if its local elements pull back to restrictions of some 
/(r)(rfr)™ G Q® n (H) with / e A 2n (r). 

Conversely, given an automorphic form / £ A.2n(r) we will construct a 
meromorphic differential u(f) <= /2 l8n (X(r)) such that tt*(w) = /(r)(dr) n . 
By the previous paragraph it suffices to construct local differentials that pull 
back to restrictions of /(r)(dr) n . Thus the idea is to express /(r)(dr) n in 
local coordinates. 

Each local map : — ^ is a composite — pj o Sj : t h- > z h- > q 
with (5j G GL2(C). Since Sj is invertible it is easy to transform /(r)(dT) n 
into z-coordinates locally. First extend the weight- A: operator [7]*. to matrices 
7 G GL 2 (C) by defining j(~f, r) = cr + d as before and 

(/[7] fc )(r) = (det 7 ) fe/2 j(7,r)- fc /(7(r)) 

when this makes sense. The exponent k/2 of det7 is a normalization conve- 
nient for the next calculation because of the formula 7'(r) = det7/j(7,T) 2 
(Exercise 3.3.3); in other contexts the factor (det7)' c ~ 1 is more convenient. 
Now let U'j = U 3 n H. Then /(r)(dr) n |[7j is a pullback <5*(Aj) where Aj is the 

differential obtained by pulling /(r)(dr)™|yj forward to z-space under SJ 1 . 
Letting a = Sj 1 , 

A, = a*(/(r)(dr)»|^) - /(a(z))(d(a(z)))" 

= (deta)"i(a,^)- 2 "/(aW)(d0)" - (/[a] 2 „)(z)(dz)". 

Thus Aj is SjTSj -inva,naxit since /(r)(dr)" is ^-invariant. Further pushing 
Aj forward from z-space to g-space isn't quite as easy since pj is not invertible 
in general, but it isn't hard to find local forms ujj that pull back under pj 
to A, as desired. 
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Specifically, if Uj ; C H, i.e., if we are not working at a cusp, then Sj takes 
Tj to 0 and the quotient {±I}(5jrSj 1 ) a /{±I} is cyclic of order h, gener- 
ated by the rotation r^ : z i-> p^z where p,h = e 27!l / h . By JjPc^-invariance 
the pullback r* h {Xj) = (/[a] 2n )(^hz)^(cb) n must equal Xj, or equivalently 
K(/H2„)(W^) = (f[a] 2n )(z), or (Mfc-2) n (/[a]2n)(Mh^) = z n (f[a] 2n )(z). In 
other words, the function z n (f[a]2n)(z) takes the form gj(z h ) for some mero- 
morphic function gj. Note for later reference in proving formula (3.8) that 
hvo(gj) = n + v Tj {f) an d thus vo{gj) — tV(r.,)(/) + n /h. Define a local mero- 
morphic differential in ^-coordinates, 



9M) 
(hq) n ' 



<"i = TT^W onV 3 . (3.6) 



Since Pj(z) — z h , this ujj pulls back under pj to Xj (Exercise 3.3.4(a)), which 
in turn pulls back under Sj to the original /(r)((ir) n |c/ j .. Thus, for each Uj C W 
the differential uij on Vj pulls back under ipj to a suitable restriction of the 
global differential /(r)(<ir)™ on H. 

On the other hand, if Uj contains a cusp Sj then <5j takes Sj to oo and the 
function {f[a\2n){z) takes the form gj(e 2 ' Klz ^ h ) where gj is meromorphic at 0 
and h is the width of s. The relevant local differential is now 

which is meromorphic at q = 0. Since q = Pj(z) = e 27Tlz / h , again ujj pulls back 
under pj to Aj (Exercise 3.3.4(b)), and as before it follows that Tp*{ujj) = 
/(r)(dr)"| l /j . Putting all of this together gives 

Theorem 3.3.1. Let k G N be even and let r be a congruence subgroup 
o/SL 2 (Z). The map 

uj ■. A k (r) — > n m / 2 (x(r)) 

f i y (Uj) je j where (uj) pulls back to f{r){dT) k ' 2 G Q m ' 2 {U) 
is an isomorphism of complex vector spaces. 

Proof. The map ui is defined since we have just constructed oj(f). Clearly to is 
C-linear and injective. And uj is surjective because every (ujj) G Q® k / 2 (X(r)) 
pulls back to some /(r)(rfr) fe / 2 G Q k l 2 (U) with / G AGO- □ 

Exercises 3.2.3 and 3.2.4 showed that for k positive and even, Ak(r) takes 
the form C(X(r))f where C(X(r)) is the field of meromorphic functions 
on X(r) and / is any nonzero element of Ak(r). Thus, Theorem 3.3.1 shows 
that i?® fe / 2 (X(r)) = C(X(r))uj(f) for such k. 

The aim of this chapter is to compute the dimensions of the subspaces 
M k (r) and S k (r) of A k (r). Now that we know that A k (r) and ft® k / 2 {X{r)) 
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are isomorphic, the final business of this section is to describe the images 
ui{Mk{r)) and uj(S k (r)) in 0® k / 2 (X(r)). Some Riemann surface theory to 
be presented in the next section will then find the desired dimensions by 
computing the dimensions of these image subspaces instead in Sections 3.5 
and 3.6. 

So take any automorphic form / e Ak(r) and let co(f) — For a 

point Tj e H, the local differential (3.6) with n — k/2 vanishes at q = 0 to 
(integral) order (Exercise 3.3.5) 

^>*« (<^) -**»></>- 5 H)' (3.8) 

In particular, at a nonelliptic point, when h = 1, the order of vanishing is 
^o(cjj) = tV(r )(/)i the order of the original function. For a cusp Sj the local 
differential (3.7) with n = k/2 vanishes at q = 0 to order (Exercise 3.3.5 
again) 

When k G N is even, formulas (3.8) and (3.9) translate the conditions 
v -K(Tj){f) > 0 an d f / 7r (s j )(/) > 0 characterizing Aik(r) as a subspace of Au{r) 
into conditions characterizing w(A / Jfe(/ 1 )) as a subspace of (2® k / 2 (X(r)), and 
similarly for Sk(r) and u(Sk(r)). In particular, the weight 2 cusp forms <?2(-0 
are isomorphic as a complex vector space to the degree 1 holomorphic differ- 
entials on X(r), denoted f2^ ol (X(r)) (Exercise 3.3.6). This special case will 
figure prominently in the later chapters of the book. 

Exercises 

3.3.1. (a) Show that the pullback is contravariant. 

(b) Show that if i : V\ — > V2 is inclusion then its pullback is the restriction 
i*(u)) = lu\ Vi for lu e n® n (V 2 ). 

(c) Show that if ip is a holomorphic bijection of open sets in C then 
(p- 1 )* = (p*)" 1 . 

(d) Show that if w : V\ — > V2 is a holomorphic surjection of open sets 
in C then it* is an injection. If i : V\ — > V2 is an injection of open sets in C, 
need i* be a surjection? 

3.3.2. (a) Let V = P ° 5 where 5 e SL 2 (Z) and p(z) = e 2mz / h . Let 
lu = g(q)(dq) n . Show that = .f(r)(dr)™ with / = f[5] 2n and f(z) = 
(2iri/h) n q n g{q) where q = e 27Tiz/h . 

(b) Consider the situation it : H — > X(r) as described in the small 
in diagram (3.5). Show that if the local meromorphic differentials ujj pull 
back under ip* to restrictions of some common global meromorphic differential 
/(r)dr on H then they are compatible. Exercise 3.3.1(d) is relevant. 
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3.3.3. For 7 € GL 2 (C), show that i(r) = dct 7/^(7, r) 2 . (This generalizes 
Lemma 1.2.2(e).) 

3.3.4. (a) For a neighborhood ?7j C H, show that the local differential u>j 
in (3.6) pulls back under pj to Aj. How was uij found in the first place? 

(b) Same question for u)j in (3.7) when Uj contains a cusp. 

3.3.5. Prove formulas (3.8) and (3.9). 

3.3.6. Prove that <S 2 (-0 and f2^ ol (X(r)) are isomorphic as complex vector 
spaces. Your proof will incidentally show that the elements of ^(-T) vanish 
at the elliptic points of X(r). Argue this directly as well by examining the 
text leading up to (3.6). 

3.4 Divisors and the Riemann Roch Theorem 

Let X be any compact Riemann surface, though for our purposes X is nearly 
always X(r). A divisor on X is a finite formal sum of integer multiples of 
points of X, 

D = ^ n x x, n x <G Z for all x, n x = 0 for almost all x. 

The set Div(X) of divisors on X forms an Abclian group, the free Abelian 
group on the points of X, under the natural definition of addition, 

n x x + n' x x = ^(n x + n' x )x. 

(Since every divisor sums over all points x € X, there is no need to continue 
notating the set of summation.) For divisors D = J2 n x x an d D' = J2 n 'x x > 
write D > D' when n x > n' x for all x. The degree of a divisor is the integer 

deg(-D) = Y Ux - 

The map deg : Div(X) — > Z is a homomorphism of Abelian groups. 
Every nonzero meromorphic function / : X — > C has a divisor 

div(/)=5> x (j>. 

Let C(X) denote the field of meromorphic functions on X and let C(X)* be 
its multiplicative group of nonzero elements. Then the map 

div : C(X)* — >DW(X) 

is a homomorphism, meaning div(/i/2) = div(/i) + div(/2) for all /i,/2 G 
C(X)*. Every / € C(X)* has a divisor of degree 0, so the group Div e (X) of 
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divisors of nonzero meromorphic functions is a subgroup of the group Div (X) 
of divisors of degree 0. Abel's Theorem (see [FK80]) states to what extent the 
converse fails: the quotient Div 0 (X) /Div (X) is isomorphic to a complex torus 
C 9 /A g where g is the genus of X and A g is a lattice spanning C 9 . We will 
discuss Abel's Theorem further in Chapter 6. 

For example, let Ai denote the lattice iZ © Z in C as usual and let X be 
the complex elliptic curve C/Ai. Exercise 1.4.3 shows that (Exercise 3.4.1) 

div^/p;) = -(0 + Ai) + (1/2 + Ai) + (i/2 + A,) - ((1 + i)/2 + A t ), 

and one similarly finds div(p^ 3 /p M3 ) on C/A^ (Exercise 3.4.1 again). 
Define the linear space of any divisor D on X to be 

L(D) = {f G C(X) : / = 0 or div(/) + D > 0}. 

The idea here is that when D = div(f) for some / G C(X)*, the linear 
space L(D) is those functions / on X vanishing to high enough order to make 
the product // holomorphic. The formula 

Mfi + h) > min{^(A), u x (f 2 )}, h,h G C(X) 

(Exercise 3.4.2) shows that L(D) forms a vector space over C. This space 
turns out to be finite-dimensional and its dimension is denoted 1(D). 

Let uj G Q® n (X) (where n € N) be a nonzero meromorphic differential 
on X. Then u has a local representation uj x — f x {q)(dq) n about each point 
x ^ X, where q is the local coordinate about x. Define v x (lo) = vo(f x ) and 
then define 

div(w) = v x {lo)x. 

So div(w 1 w 2 ) = div(wi) + div(w 2 ) for any nonzero u)i G Q® n (X) and ui 2 € 
j7® m (X). A canonical divisor on X is a divisor of the form div(A) where A is 
a nonzero element of f2 1 (X). 

Theorem 3.4.1 (Riemann Roch). Let X be a compact Riemann surface 
of genus g. Let div(A) be a canonical divisor on X. Then for any divisor 
D G Div°(X), 

£(D) = dcg(D) - g + 1 + ^(div(A) - D). 

The next result gives information about the canonical divisors and a sim- 
pler version of the Ricmann-Roch Theorem for divisors of large enough degree. 

Corollary 3.4.2. Let X, g, div(A), and D be as above. Then 

(a) *(div(A)) = g. 

(b) deg(div(A)) =2g-2. 

(c) Lfdeg(D) < 0 then 1(D) = 0. 

(d) Lfdeg(D) >2g-2 then £(D) = dcg(D) - g + 1. 
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Proof. Note £(0) = 1 since div(/) > 0 only for the constant functions 
in C(X)*, and so letting D = 0 in the Ricmann-Roch Theorem gives 
1 = — g + 1 + £(div(A)), proving (a). Next set D = div(A), combine the 
Riemann-Roch Theorem with (a) to get g = deg(div(A)) — g+1 + 1, and (b) 
follows. For (c), suppose £(D) > 0. Then some nonzero / € L(D) exists, 
so div(/) > — D and taking degrees shows that deg(D) > 0, proving (c) by 
contraposition. Finally, (b) and (c) imply (d) since if deg(Z?) > 2g — 2 then 
deg(div(A) - D) < 0. □ 

We have seen in Section 3.2 that a nonzero element / € A 2 (-0 exists for 
any congruence subgroup r of SL 2 (Z). Let A = u)(f) e Q 1 (X(r)) be the 
corresponding meromorphic differential, so that div(A) is canonical and its 
degree is 2g — 2 by Corollary 3.4.2(b). For any positive even integer k, the 
meromorphic differential X k / 2 belongs to Q® k / 2 (X{r)) and its divisor has 
degree k(g - 1). Since Q® k / 2 {X{r)) = C{X{r))X k ' 2 and deg(div(/)) = 0 
for all nonzero / <G C(X(r)), it follows that every nonzero differential lo e 
Q® k l 2 (X{T)) has a divisor of degree k(g - 1) as well. 

Exercise 3.4.3 shows that if r is a congruence subgroup of SL 2 (Z) and 
g is the genus of X(r) then the vector space fl^ ol (X(r)) of holomorphic 
differentials of degree 1 on X(r) is isomorphic to the linear space L(X) for 
any canonical divisor A, a space of dimension g by Corollary 3.4.2(a). Thus 
Exercise 3.3.6 shows that dim(S2(-T)) = g- The next two sections will broaden 
the ideas of this argument to obtain general dimension formulas for Mk(r) 
and Sk(r). 

Exercises 

3.4.1. Confirm the section's formula for div(p^/pj). What is div(p^ 3 /p M3 )? 

3.4.2. Show that v x {f 1 + f 2 ) > minK(A), u x (f 2 )} for f u f 2 e C(X) where 
x is any point of the compact Riemann surface X. 

3.4.3. Recalling from the discussion immediately after Theorem 3.3.1 that 
Q 1 {X{r)) = C(X(r))\ for any nonzero A € i7 1 (X( J T)), show that the map 
J7 1 (X(_T)) — > C(X(r)) taking each / 0 A to / 0 is a vector space isomorphism 
mapping the holomorphic subspace il^ ol (X(r)) to the linear space L(X). 

3.5 Dimension formulas for even k 

We can now express the results from Section 3.3 in the language of divisors. 
Let k be any even integer. Then using formulas (3.2) and (3.3), every nonzero 
automorphic form / e Ak(r) has a well defined divisor ^2v x {f)x on X(r) 
even though / itself is not well defined on X(r). However, formula (3.2) shows 
that the coefficients in this divisor need not be integers at the elliptic points. 
To allow for divisors with rational coefficients, make the definition 
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Div Q (x(r)) = 



n x x, n x G Q for all x, n. 



0 for almost all x 



This is the free Q-module (or free vector space) on the set of points of X(r). 
Thus div(/) G DivQ(X(r')). The discussion of Div(X) as an Abelian group 
with a relation ">" and a degree function extends to DivQ(X), but the 
Ricmann Roch Theorem requires integral divisors. 

Now let the even integer k satisfy k > 2, and let / be any nonzero element 
of Ak(r) (Exercise 3.2.3 showed that such / exists). Then (Exercise 3.5.1) 



The holomorphic subspace M. k (r) of modular forms can be described in terms 
of divisors, 



where the isomorphism is a complex vector space isomorphism. If div(/) were 
integral then we could now compute dim(A^fc(-T)) with the Riemann-Roch 
Theorem, but it isn't. To approximate div(/) by an integral divisor, define 
the greatest integer function [ J on rational divisors by applying it to the 
coefficients, 



On the other hand, div(/o) is integral since /o is meromorphic on X(r), giving 
the equivalence 

div(/ 0 ) + div(/) > 0 div(/ 0 ) + |div(/)J > 0. 

This shows that M k (r) = L([div(/)J) and so dim(A4 fc (r)) = ^(|_div(/)J). 

To study the integral divisor [div(/)J let u G H m / 2 (X(r)) be the mero- 
morphic differential uj(f) from Theorem 3.3.1 that pulls back to /(r)((iT) fc / 2 
on T-L. Let {x 2 . i} be the period 2 elliptic points of X(r), {a^3,i} the period 3 
elliptic points, and {xi} the cusps. As in Section 3.1, let e 2 , £3, and be the 
sizes of these sets. Make the formal definition 



an element of DivQ(X(r)) even though rfr is not well defined on X(r). Then 
formulas (3.8) and (3.9), relating the vanishing orders of oj and /, show that 
compatibly with the identification of to and f(T)(dT) k ^ 2 and with the rule that 
the divisor of a product is the sum of the divisors (Exercise 3.5.2(a)), 



A k (r) = c(x(r))f = {f Q f : f 0 G c(x(r))}. 



M k {r) = {f 0 f G Mn ■ fof = 0 or div(/ 0 /) > 0} 

S {/o G C(X(r)) : f 0 = 0 or div(/ 0 ) + div(.f) > 0} 





div(w) = div(/) + (jfc/2)div(dr). 



(3.10) 
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Here the left side is integral, the right side a sum of rational expressions. It 
follows that (Exercise 3.5.2(b)) 



Ldiv(/)j = divM + 



X2,i 



E 



%3, 



Ek 



(3.11) 



Therefore by the discussion immediately after Corollary 3.4.2 showing that 
deg(div(w)) = k(g - 1), 



deg(Ldiv(/)J) = %-!) + 



k 




k 




£2 + 




_4_ 


3 



£3 + 2 £c 



(Exercise 3.5.2(c)) 



k , ^ k 2 k 2 k 

> 2 (2ff - 2) + — £ 2 + —£3 + 2 £ oo 

„ k - 2 /„ „ £ 2 2e 3 
= 2,-2+— ^-2+^ + ^ +£o 

> 2g — 2 + by Corollary 3.1.2, since fc > 2 
>2 3 -2. 

So |_div(/)J has large enough degree for Corollary 3.4.2(d) to apply and the 
simpler form of the Ricmann- Roch Theorem says that for even k > 2, 



dim(M fe (r)) = £(Ldiv(/)J) = (k - \){g - 1) 



For the cusp forms Sk(r), a similar argument shows that Sk(r) = 
L(|div(/) - J2i x t\) and that (Exercise 3.5.2(d)) 

dim(5 fe (r))-£(Ldiv(/)-^^j)=£(Ldiv(/)J)- £co forfc>4. (3.12) 



k 




k 




k 


4 


£2 + 


3 


£3^ 


- 2^00 



For k = 2 the divisor |_div(/) — x d works out to a canonical divisor div(A), 
cf. Exercise 3.4.2, and so its linear space has dimension g. 

Finally we dispense with the cases where the weight k is nonpositive. A 
modular form of weight 0 with respect to r is a meromorphic function on X(r) 
with no poles. Since any holomorphic map between compact Riemann surfaces 
cither surjects or maps to a single point (this was discussed in Section 1.3), 
this shows that Aio(r) ~ C (the constant functions) and So(r) = {0}. It 
follows that M k (r) = {0} for k < 0, for if / e M k (r) then f 12 A- k (where 
A is the weight 12 cusp form from Chapter 1) lies in 5o(r) = {0}. This 
argument works whether k is even or odd. 

Assembling the results of this section gives 

Theorem 3.5.1. Let k be an even integer. Let r be a congruence subgroup 
o/SL 2 (Z), g the genus ofX(r), e 2 the number of elliptic points with period 2, 
e 3 the number of elliptic points with period 3, and the number of cusps. 
Then 
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dim(M k (r)) 



'(k-l)(g-l)+ [l\ e 2 + LfJe 3 + |eco ifk>2, 
1 ifk = 0, 

0 z/fc<0, 



and 



dim(5 fc (r)) 



'(fc-l)( 5 -l)+ L|J £2 + L|Je 3 + (|-l)£co »/^>4, 
5 i/fc = 2, 

0 ifk<0. 



In concert with the results from Exercises 3.1.4 through 3.1.6 this gives 
dimension formulas for Io(p), A(p), and r(p). When r is the full modular 
group SL2(Z), the theorem is 

Theorem 3.5.2. The modular forms of weight 0 are .Mo(SL 2 (Z)) = C. For 
any nonzero even integer k < A, .M/^SL^Z)) = {0}. For any even integer 
k < A, <Sfc(SL 2 (Z)) = {0}. For any even integer k > A, 

M fc (SL 2 (Z)) - 5 fe (SL 2 (Z)) 0 CE k 

where E k is the normalized weight k Eisenstein series from Chapter 1, and 



dim(5 fe (SL 2 (Z))) = 



LtIJ-1 ifk = 2 (mod 12), 
[j|J otherwise. 



The ring of modular forms A / J(SL 2 (Z)) = ^^ fe ^ z A^fe(SL 2 (Z)) and the ideal 

of cusp forms <S(SL 2 (Z)) = ^J^^ z <Sfc(SL 2 (Z)) are a polynomial ring in two 
variables and a principal ideal, 

M(SL 2 (Z)) = C[E 4 ,E 6 ], 5(SL 2 (Z)) = A ■ M(SL 2 (Z)) 

where A is the discriminant function from Chapter 1. 

The proof is Exercise 3.5.3. Exercises 3.5.4 and 3.5.5 use the results of this 
section to describe some spaces of modular forms explicitly. 

Exercises 

3.5.1. Show that A k (r) = C(X(r))f for any nonzero / e A k (r). 

3.5.2. (a) Prove formula (3.10). 

(b) Prove formula (3.11). 

(c) Prove that for k > 2 even, [k/A\ > (k - 2)/4 and [k/3\ > (k - 2)/3. 

(d) Prove formula (3.12). Where does the condition k > A arise? 
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3.5.3. Prove Theorem 3.5.2. Write bases of 7W fc (SL 2 (Z)) for all k < 24, ex- 
pressing the basis elements as polynomials in E 4 and E 6 . (A hint for this 
exercise is at the end of the book.). 

3.5.4. (a) Let p be prime. Exercise 3.1.4(b) showed that Xo(p) has exactly 
two cusps. Use this to show that 



where G 2 , P was defined in Section 1.2. 

(b) Let k and N be positive with k(N+ 1) = 24 and k even. Thus (k, N) is 
one of (2,11), (4,5), (6,3), (8,2), or (12,1). Show that dim(S k {r Q (N))) = 1. It 
follows from Proposition 3.2.2(b) that S k (r 0 (N)) = S fe (.Ti(iV)) = C(p k where 
<^(t) = r)(T) k ri(NT) k . 

(c) What do (a) and (b) combine to say about A^2(^o(H))? 

3.5.5. Combine Exercise 3.1.4 and Theorem 3.5.2 to show that letting k = 
p+ 1 for p an odd prime, 6>fc(SL 2 (Z)) and S 2 (ro(p)) are isomorphic as complex 
vector spaces. What is the isomorphism when p = 11? (A hint for this exercise 
is at the end of the book.) 

3.6 Dimension formulas for odd k 

Throughout this section k is an odd integer. Computing dimension formulas 
for A4k(r) and S k (r) is essentially similar to the case where k is even, but it 
is a bit more technical. The formula (/[— I] k )(r) = — /(t) shows that A k (r) = 
{0} whenever — / G r, so throughout this section — / ^ T as well. This implies 
that r has no elliptic points of period 2, as shown in Exercise 2.3.7. 

Since Ai k (r) = {0} for all negative k, assume that the odd integer k is pos- 
itive. The argument in the preceding section that dim(A4 k (r)) — £([div(/)J) 
for any nonzero / € A k (r) didn't depend on k being even and thus remains 
valid here. On the other hand, the existence of such / has been established 
only for k even. Assume for now that / exists for odd k as well; this will be 
proved at the end of the section. Since k is odd and —I^T, the modular 
curve X(r) has two possible types of cusps as discussed in Section 3.2, the 
regular cusps where v ff («)(/) is integral and the irregular cusps where ^V(s)(/) 
is half-integral. For example, Exercise 3.2.1 showed that 1/2 is an irregular 
cusp of A (4). 

This time let u € n® k (X(r)) be the differential ui(f 2 ) from Theorem 3.3.1, 
pulling back to /(T) 2 (dr) fe on %. Retain the notations {2:3,1} and £3 for the 
period 3 elliptic points, and let {xi} and {x^} be the regular and irregular 
cusps, with corresponding e™ s and e 1 ^. The formal definition 



M 2 (r 0 (p)) = s 2 (r 0 ( P ))®CG 2 , : 
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is consistent with the definition of div(rfr) in Section 3.5 since there are no 
period 2 elliptic points now. By formulas like (3.8) and (3.9) but with / 2 
and k replacing / and fc/2 it gives the appropriate divisor identity div(w) = 
2div(/) + fcdiv(d"r) (Exercise 3.6.1), implying 



div(/) = Jdiv(w) + + \ Xi + ~ r 



(3.13) 



To study |_div(/)J, consider equation (3.13). At any point x {x^a} U 
{xi} U {x'j} of X{r), the order v x {f) is integral, and so \v x {uj) must be 
integral as well. At a point x = x^^, write v x (f) = m + j/3, with m and 
j integers, cf. equation (3.2). Then ^v x (ui) — m + (j — k)/3. Since v x (u) is 
integral this shows that j = k (mod 3) and ^u x {uo) € Z. Thus the integral 
part of v x (f) is ^v x {ui) + [k/3\ . Similarly, at a point x — x,, the integral part 
of v x (f) is \v x (u) + |, a sum of half-integers (Exercise 3.6.2(a)), and at a 
point x = x\, the integral part of v x (f) is \v x (uS) + a sum of integers 
(Exercise 3.6.2(b)). Thus 

Ldiv(/)j = idiv(u;)+£ 

i 

The degree satisfies (Exercise 3.6.3(a)) 

deg(Ldiv(/)J) = %-l) + 
(k - 2) (.g - 



X3,i 



2^ 0 X% 



£ 3J + 2^°° 2 £ °° 



> 



1 + 



£3 



-)+2g- 



>2g-2 if k > 3. 



So when fc > 3, Corollary 3.4.2(d) of the Riemann-Roch Theorem applies, 
giving 



*(Ldiv(/)j) = (fc-l)( ff -l) 



£3 + 



+ 



To study cusp forms, recall the general element fof € -4fc(-T) and note 
that at a regular cusp x, 



Vx(fof) > 0 ^*(/o/) > 1 

while at an irregular cusp x' , 

M/o/) >0 M/o/) > 1/2 



^(/) + ^(/o)-l >0, 



M/) + M/o)-l/2>0. 



Thus the relevant dimension is ^(|_div(/) — x -i ~ 120-/ty x 'i})- Similar to the 
calculation above, this works out for fc > 3 to (Exercise 3.6.3(b)) 
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*(Ldiv(/)-5>-E^.I) = 



2 

(k-l)(g-l) + 



This proves 



Theorem 3.6.1. Let k be an odd integer. Let r be a congruence subgroup 
of SL 2 (Z). If r contains the negative identity matrix —I then Mk(r) = 
Sk(r) = {0}. If —I $l r, let g the genus of X(r), e 3 the number of ellip- 
tic points with period 3, e™ s the number of regular cusps, and e 1 ™ the number 
of irregular cusps. Then 



dim(M fc (r)) = 

and 

dim(5 fe (r)) = J 



l)( 5 -l)+L!Je 3 + fe£ s +^ i ££ z/fc>3, 

if k < 0, 



l)(ff-l)+ [Hes+^e^+^eZ *fk>3, 

ifk<0. 



IfeZ s > 2g - 2 then dim(Mi{r)) = and dim(5i(r)) = 0. If e™ s < 

2g - 2 then dim(Mi(T)) > e^/2 and dim(Si(r)) = dim(Mi(r)) - e^/2. 

The results for k — 1 are Exercise 3.6.4. 

Regular and irregular cusps will be discussed further in Section 3.9. Here 
it remains to prove that some nonzero / € Ak(r) exists when the positive 
integer k is odd and —I £ T. The argument cites some results from Riemann 
surface theory that should be taken for granted if they arc unfamiliar. 

Let A be any nonzero element of X2 1 (X(r , )) ) for example the divisor that 
pulls back to j'(t)c?t on % where / is the derivative of the modular in- 
variant j. Let xo be any point of X(r). Then Corollary 3.4.2(b) of the 
Riemann-Roch Theorem says that div(A) — 2(g — l)x 0 e Div°, where as 
before Div° = Div°(A(_T)) denotes the divisors of degree 0. Recall also that 
Div^ denotes the divisors of nonzero meromorphic functions on X(r), a sub- 
group of Div°. Under the map Div°/Div £ — > C 9 /A g of Abel's Theorem (de- 
scribed in Section 3.4), let z + A g be the image of div(A) — 2{g — 1)xq + Div^. 
Any divisor D e Div° such that D + Div £ maps to z/2 + A g satisfies 
2D = div(A) - 2(g - l)x 0 + div(/) for some nonzero / e C(X(r)). So 

div(/A) = 2(£) + ( fl -l)a;o). 

Let / be the function such that the differential /A on X(r) pulls back to 
j{r)dT on H. Then 

div(/) = div(/A) - div(dr) 

2 

= 2(D + (g - l)x 0 ) + M + Yl Xi + E x 'i- 
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This shows that u T (f) is even for all r£M, since v T {f) = 3i/„.(t)(/) when 
7r(r) is an elliptic point x 3i , and the cusps are not in "H. So there exists a 
function /i on "H such that f\ = f. Since / is weight-2 invariant under r, f\ 
transforms by the rule (/i[7]i) = x(l)fi f° r an 7 ^ -T, where x ■ r — > {±1} 
is a character. Let f = {7 e f : x(l) = 1}) a subgroup of index 1 or 2. If the 
index is 1 then fi e A\ (r) . 

If the index is 2 then r = r' U r"a where = — A- The surjection of 

Ricmann surfaces ir : X(r') — > X(r) pulls back to an inclusion of their fields 
of mcromorphic functions, ir* : C(X(r)) — > C(X(r')), where the pullback 
of a function / G C(X(r)) is the function ir* f = f o tt e C(X(r')). After 
identifying C(X(i^)) with its image, the extension of function fields has the 
same degree degree as the original surjection, namely 2, and the Galois group 
r/r' acts as {±1} on generators of the field extension. So there exists some 
f 2 € C(X(r')) such that f 2 o a = -f 2 and thus (/i/ 2 )[a]i = /1/2, showing 

that /i/ 2 e^i(r). 

Whichever of /1 or /i/ 2 lies in _4i(T), its fcth power is a nonzero element 
of Ak(r) as desired. 

Exercises 

3.6.1. Establish the formula div(/) = 2div(/) + fcdiv(dr) in the context of 
the section. 

3.6.2. (a) Show that at a point x — Xi the integral part of v x {f) is \v x (ui) + 
k/2, a sum of half- integers. 

(b) Show that at a point x = x\ the integral part of v x (f) is \v x (uj) + (k — 
l)/2, a sum of integers. 

3.6.3. (a) Justify the calculation that deg(|div(/)J) > 2g - 2 if k > 3. 

(b) Establish the formula for i{ Ldiv(/) - Ei ^ ~ EiC 1 / 2 )^] ) whcn fc > 3 - 

3.6.4. Prove the remaining statements in Theorem 3.6.1. 

3.6.5. Show that dim (£3 (A (7)) = 1, so Proposition 3.2.2(a) shows that this 
space is spanned by the function ^(t) — rj(r) 3 r](7T) 3 . 



3.7 More on elliptic points 

Exercise 2.3.7 found the elliptic points of the groups F(N) and i~i(iV), and it 
obtained some information about the elliptic points of -Tb(iV). Exercise 3.1.4 
of this chapter found the elliptic points of r 0 (p) when p is prime. This section 
finds the elliptic points of r 0 (N) for general N. 

A point r e % is an elliptic point of r n (N) when its isotropy subgroup 
takes the form ro(N) T = (7), the matrix 7 having order 4 or 6, twice the pe- 
riod of r. For any a € r 0 (N), the translated point a(r) has the corresponding 



3.7 More on elliptic points 93 



conjugate isotropy subgroup /o(JV) a ( r ) = a(7)a: -1 . An elliptic point r 0 (N)r 
of the modular curve X 0 (N) = X(r 0 (N)) thus corresponds to a conjugacy 
class of isotropy subgroups of Jo (AT), 

r 0 (N) T <— > {a( 7 )a- 1 : a G r 0 (AO} where r 0 (JV) T = (7). 

Conjugacy classes of elements are more convenient than conjugacy classes 
of subgroups, so it is tempting to replace the subgroup (7) by its generator 
7 in the correspondence. But this is ill defined since each nontrivial isotropy 
subgroup (7) is also generated by 7 -1 . Instead each elliptic point corresponds 
to a pair of conjugacy classes, 

r 0 (N)r <— ► {{07a" 1 : a G r 0 (N)}, {cry" 1 **" 1 : a G r 0 (N)}} , 

where again rg(N) T = (7). The inverse pair of generators 7 and 7 _1 are 
not conjugate in r 0 (N) (Exercise 3.7.1(a)), so the two conjugacy classes are 
distinct and in fact disjoint (Exercise 3.7.1(b)). Thus the number of period 2 
elliptic points is half the number of conjugacy classes of order 4 elements, and 
similarly for period 3 and order 6. 

To correct the factor of a half, introduce the group 



a b 
c d 



G GL 2 (Z) : 



a b 
c d 



* * 
0 * 



(mod N) 



(where GL2(Z) is the group of invcrtible 2-by-2 matrices with integer en- 
tries), similar to -Tb(-/V) but its elements can have determinant ±1. Since the 
map det : r o (N) — > {±1} is a surjective homomorphism, Jo (AT) is normal 



in lf(N) and [rf(N) 
conjugacy class 



r 0 (N)] = 2. Thus for any 7 G r a (N) the extended 

{eryoT 1 : a G r±(N)} 

lies in r 0 (N). This is the union of two conjugacy classes under r 0 (N) (Exer- 
cise 3.7.1(c)), so the number of period 2 (or period 3) elliptic points of Jo (AT) 
is the number of extended conjugacy classes of order 4 (or order 6) elements 
of r 0 (N). The extended conjugacy class of 7 under r^(N) is not in general 
the union of the conjugacy classes of 7 and j^ 1 under Jo (AT) (see Exer- 
cise 3.7.1(d)). 

Counting the elliptic points of Jo (AT) by counting these conjugacy classes 
is done in the same environment as the proof of Proposition 2.3.3. Consider 
elliptic points r of period 3 with T 0 (A^) T = (7), 7 s = I. Let fi 6 = e 2 ™/ 6 be 
the complex sixth root of unity and let A = Z[^ 6 ], a principal ideal domain. 
Each extended conjugacy class will correspond to an ideal J of A such that 
A/ J Z/NZ as an Abelian group, 

{07a" 1 : a G r±(N)} i — ► J 7 . 



To construct the ideal J 7 from 7, note that the lattice L = Z is an A-modulc 
with the multiplication © 7 : A x L — > L given by 
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(a + b^o) 0 7 I = (al + bj)l for a, b £ Z and / £ L. 

Define Lq(N) = {[y] £ L : y = 0 (mod -/V)}, a subgroup of 1/ such that 
the quotient L/Lq(N) is isomorphic to Z/NZ. Then Lq(N) is also an A 
submodulc of L since 7 £ -To(-/V) (Exercise 3.7.2), making L/Lq(N) an A 
modulc. Let J 7 be its annihilator, 

J 7 = Ann(L/L 0 (A r )) = {a + £ A : (a + &^ 6 ) © 7 L/L 0 (N) = 0}. 

Thus A/J 7 = i0 7 L/L 0 (N) = L/L 0 (N) = Z/NZ as desired. 

To see that the ideal J 7 depends only on the extended conjugacy class 
of 7, let 7' = aja - 1 with a £ r^(N). Then 7' is also an order-6 element of 
r 0 (N), and L is also an A module associated to 7' with multiplication 

(a + bfi 6 ) 0 7 / Z = (a/ + Ir/)l. 

Let L' denote L as the Amodule associated to 7' in this fashion, and similarly 
for L' 0 (N). For any a, b £ Z and Z £ 1/ compute that 

a((a + bfie) ©7 0 = cn(al + 67)/ = (al + b-y')al since 017 = 7' a 
= (a/ + 6^i 6 ) 0 7 / aZ. 

This shows that multiplication by a gives an Amodule isomorphism L — > V , 
and since aL 0 (N) C L' 0 (N) because a £ r^(N) this induces an isomorphism 
of quotients L/L 0 (N) L'/L' 0 (N). Compute that for any j = a + b^ £ A, 

3 ^7 ^ 3 ©7 L/L 0 {N) = 0 L/Lo{N) 

<^=> a(j 0 7 L/L 0 (N)) = 0 L '/L' o (N) 
<^=> j 0y a(L/L 0 (N)) = 0 L >/L> o (N) 
j 0 7 < L'/L' 0 (N) = 0 L '/l' o (n) 

j £ Jy, 

showing that indeed J 7 depends only on the extended conjugacy class. 

Conversely to the map from conjugacy classes to ideals, start now from an 
ideal J of A with A/J = Z/NZ. By the structure theorem for modules over a 
principal ideal domain again, there exists a basis (u, v) of A over Z such that 
(u, Nv) is a basis of J over Z. Since (u, v) is a basis of A over Z, 

(n&u, /j,qv) = (ua + vc, ub + vd) for some integers a, 6, c, d, 

or 

/j,q(u,v) = (u, w)7j, for some 7,/ = [jjl £ M2(Z). 

So the matrix 77 has order 6 and has the same minimal polynomial X 2 — X+ 1 
as /U6, showing that 77 £ SL 2 (Z) (Exercise 3.7.3(a)). And since u £ J and J is 
an Asubmodulc of A, the formula fJ-eu — ua + vc shows that c = 0 (mod iV), 
i.e., 77 £ r 0 (N). Furthermore, for any a £ r^(N) consider the Z-basis of A 
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(u , v ) = (it, i>)a. 

A calculation (Exercise 3.7.3(b)) shows that (u 1 , Nv') is again a Z-basis of J. 
So now 

/u 6 (w', v') = fi 6 (u, v)a = (u, v)ja = (u' , v / )a~ 1 -y J a, 

and so the basis (u,v) of A and its -Tp ( iV)-translates map back to a Tq(N)- 
conjugacy class of an order-6 clement 7,/ in Tq(N). Finally, any two Z-bases 
(u, v) and (u' , u') of ^4 such that (u, Nv) and (u', iW) are Z-bases of J satisfy 
the relation (u 1 ,v') — (u,v)a for some a G T^(N) (Exercise 3.7.3(c)), so the 
conjugacy class is the same for any such Z-basis of A. 

Thus we have a map from extended conjugacy classes to ideals and a 
map back from ideals to extended conjugacy classes. Suppose the class of 
7 G r 0 (N) maps to the ideal J 7 of A. The proof of Proposition 2.3.3 shows 
that 7 = mT 1 [J ~\ ] to for some m — [" ^] G GL 2 (Z), and this computes out 
to 



7 = 



ac 



ad+ cd 



a 1 + ac+ c z = 0 (mod TV). 



Let (u, w) = (1, Ho)m, a Z-basis of A. A calculation using the displayed descrip- 
tion of 7 shows that u annihilates L/L n (N) (Exercise 3.7.4), as Nv clearly docs 
as well. Thus (u, Nv) is a Z-basis of an yl-submodulc J of Ann(L/Lo(A0) = 
with [A : J] = N = [A : J 7 ], showing that J — J 7 . This ideal maps back to 
the extended conjugacy class of the matrix 7' such that fia(u 7 v) = (u,v)j'. 
Compute that 

fi 6 (u, v) = /j, e (l,/j, 6 )m = (^ 6 , -1 + fi 6 )m 

= (i.Me) [1 ~{] m = (u^v)™- 1 [° ~\] m = (u, v)j, 

so J 7 maps back to the class of 7 as desired. On the other hand, suppose 
the ideal ,/ of A such that A/ J = Z/NZ maps to the extended conjugacy 
class of 7 e r 0 (N). This means that hq(u, v) — (u, v)j for some Z-basis (u, v) 
of A such that (u, Nv) is a Z-basis of J. Note that (u, v) = (1, ^@)m for some 
771 = [c d] *= GL 2 (Z). As above, [1q(u,v) = {u 1 v)m^ 1 [° ~ 1 ] to, showing that 
7 = m _1 [ 1 — 1 ] to. Again the displayed description of such a 7 shows that w 
annihilates L/L 0 (N) and consequently J = J 7 . That is, the class of 7 maps 
back to J. 

A virtually identical discussion with A = Z[i] applies to period 2 elliptic 
points (Exercise 3.7.5(a)), proving 

Proposition 3.7.1. The period 2 elliptic points of Tq(N) are in bijective cor- 
respondence with the ideals J of Z[i] such that Z[i]/J = Z/NZ. The period 3 
elliptic points of Tq(N) are in bijective correspondence with the ideals J of 
Z\p 6 ] (where ^ = e 2 ™/ 6 j such that Z[/i 6 ]/J ^ Z/NZ. 



Counting the ideals gives 
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Corollary 3.7.2. The number of elliptic points for r 0 (N) is 

£2 (r 0 (AO) = ( n ^ (1 + (# lf ^ N > 

[O if A | N, 

where (— 1/p) is ±1 if p = ±1 (mod 4) and is 0 if p = 2, and 

e 3 (W)) = { n ^Mf)) W"' 
[0 if9\N, 

where (— 3/p) is ±1 if p = ±1 (mod 3) and is 0 z'/p = 3. 

These formulas extend Exercise 2.3.7(c) and Exercise 3.1.4(b,c). 

Proof. This is an application of beginning algebraic number theory; see for 
example Chapter 9 of [IR92] for the results to quote. For period 3, the ring 
A = Z[^i 6 ] is a principal ideal domain and its maximal ideals are 

• for each prime p = 1 (mod 3), two ideals J p = (a + 6/ig) and J p = (a + 
bfl 6 ) such that (p) = JpJp and the quotients A/J p and A/J e p are group- 
isomorphic to Z/p e Z for all e <G N, 

• for each prime p = — 1 (mod 3), the ideal J p = (p) such that the quotient 
A/Jp is group-isomorphic to (Z/p e Z) 2 for all e € N, 

• for p = 3, the ideal J3 = (1 + p 6 ) such that (3) = Jf and the quotient ^4/ Jf 
is group-isomorphic to (Z/3 e / 2 Z) 2 for even e € N and is group-isomorphic 
to Z/3< e+1 )/ 2 Z 0 Z/S^-V^Z for odd e e N. 

The formula for £ 3 (ro(^V)) now follows from Proposition 3.7.1 and the Chi- 
nese Remainder Theorem. Counting the period 2 elliptic points is left as Ex- 
ercise 3.7.5(b), similarly citing the theory of the ring A — Z[i]. □ 

The elliptic points of r 0 (N) can be written down easily now that they are 
counted. Consider the set of translates in H 

(Ms) : 0 < n < JvJ . 
The corresponding isotropy subgroup generators °] [_ n 1] axe 

: 0 < n < ivj . 

The number of these that are elements of Jo(iV) is the number of solutions 
to the congruence x 2 — x + 1 = 0 (mod N), and this number is given by 
the formula for £3 (To (N)) in Corollary 3.7.2 (Exercise 3.7.6(a)). The cosets 
{r Q (N) [i J] : 0 < n < N} are distinct in the quotient space r 0 (A^)\SL 2 (Z), 
though they do not constitute the entire quotient space, and the corresponding 




f n -1 
\ n 2 — n+ll-n 
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orbits r 0 (N) [ * °] for such n such that n 2 — n + 1 = 0 (mod N) are 
distinct in X 0 (N) (Exercise 3.7.6(b)). Thus we have found all the period 3 
elliptic points of Jo (AT) (Exercise 3.7.6(c)), 

Jo (AT) " + M3 - , n 2 -n+l = 0(modA^). (3.14) 
— n + 1 

Similarly, the period 2 elliptic points are (Exercise 3.7.6(d)) 

Fa{N) i^Tl' n 2 + l = 0 (modJV). (3.15) 

Exercises 

3.7.1. (a) Show that if 7 generates a nontrivial isotropy subgroup in SL2(Z) 
then 7 and 7" 1 are not conjugate in GL^(Q). (Hints for this exercise are at 
the end of the book.) 

(b) Show that any two conjugacy classes in a group are either equal or 
disjoint. 

(c) Show that the J^AQ-conjugacy class of 7 G Jo(A r ) is the union of the 
Jo(A r )-conjugacy classes of 7 and [jjj -1] 7 [0 -?]■ Show that if 7 has order 4 
or 6 then this union is disjoint. 

(d) Let 7 = [? -J] [^F 1 = [5 Z§] , an ordcr-4 clement of T 0 (5). 
Show that 7 is not conjugate to its inverse in J n 0 ± (5). 

3.7.2. In the context of mapping a matrix conjugacy class to an ideal, show 
that L 0 (N) is an A-submodule of L. 

3.7.3. (a) In the context of mapping an ideal J of A such that A/ J = Z/NZ 
back to a matrix conjugacy class, retain the notation (u, v) for a Z-basis of A 
such that (u,Nv) is a Z-basis of J. Show that the matrix jj £ M 2 (Z) such 
that hq(u,v) = (u,v)-fj lies in SL 2 (Z). 

(b) For any a £ r^(N) consider the Z-basis (u',v') = (u,v)a of A. Show 
that (u',Nv') is again a Z-basis of J. 

(c) Show that any two such Z-bases (u, v) and (u',v') of A satisfy the 
relation («', v') = (u, v)a for some a £ r^(N). 

3.7.4. In the context of checking that the maps between conjugacy classes 
and ideals invert each other, let 7 £ Jo (AT) of order 6 be given and define 
(u,v) = (l,[i(i)m as in the section. Show that u 0 7 L C L 0 (N), so that u 
annihilates L/L 0 (A r ). (A hint for this exercise is at the end of the book.) 

3.7.5. (a) Similarly to the methods of the section, check that the first half of 
Proposition 3.7.1 holds. 

(b) Prove the first half of Corollary 3.7.2. (A hint for this exercise is at the 
end of the book.) 
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3.7.6. (a) Show that the number of solutions to the congruence x 2 — x + 1 = 
0 (mod N) is given by the formula for e^(r 0 (N)) in Corollary 3.7.2. (A hint 
for this exercise is at the end of the book.) 

(b) Show that the orbits r 0 (N) [ * °] (/x 3 ) for n = 0, . . . , N - 1 such that 
n 2 — n + 1 = 0 (mod N) are distinct in X 0 (N). 

(c) Confirm formula (3.14). 

(d) Similarly show that the period 2 elliptic points of r 0 (N) are given by 
formula (3.15). 

3.7.7. Let p e and M be positive integers with p prime, e > 1, and p \ M . Let 
m = p^ 1 (mod M), i.e., rap = 1 (mod M) and 0 < m < M. Consider the 
matrices 

1 0" 



a,- 



Mj 1 



and 

Show that 



1 {mjp- 1)/M 



0 < j < P e 
= m + jM, 0<j< p^ 1 . 



r 0 ( P e M)\r 0 (M) = \Jr 0 ( P e M) aj u \Jr Q {p e M)^ 



Iterating this construction gives a set of representatives for r , 0 (A r )\SL 2 (Z) 
where N — \\p e , including the representatives used at the end of the section 
to find the elliptic points. For example, if representatives for rQ(p e qf)\r$(qf) 
are {ao, . . . , a p e_ lt /3 0 , . . . , P p e-i-i} and representatives for ro(<^)\SL 2 (Z) 
are {a' Q , a' g/ _ 1 ,/3o, • ■ • ,0^-1^} then the p e (l + l/p)q f (l + l/q) products 
{ajce'j,, aj/3'j,, PjCt'j,, (3j(3'ji} are a set of representatives for 7 n o(p e 9^)\SL 2 (Z). 



3.8 More on cusps 

This section describes the cusps of the congruence subgroups r(N), Ii(iV), 
and To (AT). When N = 1, these groups are SL 2 (Z) and the only cusp is 
SL 2 (Z)oo, so throughout the section let N > 1. 

The first result states that the familiar action of SL 2 (Z)/{±J} on QU{oo} 
comes from the action of SL 2 (Z) on Q 2 . 

Lemma 3.8.1. Let s — a/c and s' = a! jd be elements o/QU {oo}, with 
gcd(a,c) = gcd(a',c') = 1. Then for any 7 G SL 2 (Z), 



s' = 7(s) 



±7 
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Proof. Assume that c^OandcV 0. Then for 7 = [ p r q t ] e SL 2 (Z), 

pa + qc 



s' = 7(s) 



a 



ra + tc 



Both fractions on the right side are in lowest terms (Exercise 3.8.1(a)), and 
so the result follows in this case. The cases when at least one of c, d is zero 
are left as Exercise 3.8.1(b). □ 

The notation from the lemma will remain in effect throughout the section. 
Thus s — a/c and s' = a! jd are elements of Q U {00} expressed in lowest 
terms, so that ["] and [ a , ] are elements of Z 2 with gcd(a, c) = gcd(a', d) = 1. 
The gcd condition automatically excludes the zero vector. If ["] = 7 ] for 
some 7 e r(N) then taking the relation modulo N gives the congruence 
["] = [ a , ] (mod AT). The next result, that the converse holds as well, is the 
small technical point needed to describe the cusps. 

Lemma 3.8.2. Let ["] and [" ] be as above. Then 

for some 7 G T(N) °, = ° (mod AT). 



and let 7 



"a'~ 




a 




= 7 




c' 


c 



Proof. " =>■ " is immediate, as just noted. For " ^= " first assume [°] = [J]. 
Then a' = 1 (mod iV). Take integers /3 and <5 such that a! 5 - d/3 = (1 - a')/iV 

"a' /3JV 



. Then 7 e r(N) and [«'] = 7 [ %\ in this case. 



In the general case there exist integers b and d such that ad — be = 1 . Let 



a 



' ab~ 
.c d. 



G SL 2 (Z). Since a[J] 



it follows that a 



[J] working modulo iV. Thus a -1 [ a , ] = 7'a _1 [°] for some 7' e r(N), as in 
the preceding paragraph. Since -T(A^) is normal in SL 2 (Z), setting 7 = ar( 'or 1 
completes the proof. □ 

Proposition 3.8.3. Let s = a/c and s' = a! jd be elements of Q U {00} with 
gcd(a,c) = gcd(a',c') = 1. Then 



r(N)s' = r(N) s 





'a!' 


= ± 


a 




d 








c 



snrf 



'a'" 


= ± 


a + jc 


c'_ 




c 



and 



r 0 (N)s' = r 0 (N) s 



yd 




a + jc 


d 




yc 



(mod AO, 



(mod N) for some j, 



(mod N) for some j, y. 



Ln the second and third equivalences j can be any integer, and in the third 
equivalence y is any integer relatively prime to N . 
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Proof. For the first equivalence, 

r(N)s' = r(N)s -^=> s' = 7(s) for some 7 e r(N) 

[£,'] = ±7 ["] for some 7 G r(AT), by Lemma 3.8.1 
[«,'] =±[°] (mod iV), by Lemma 3.8.2. 

The decomposition A(^0 = Uj A^O [01] re d uces the second equivalence 
to the first, cited at the last step of the computation 

A(JV)s' = A(iV)s <^=> s'eA^s 

<*=^ s' e r(AT) [ 0 j ] s for some j 

r(AT)s' = r(AT)(s + j) for some j 
[«,'] = ± [ a +J' c ] (mod AT) for some j. 

Similarly, the decomposition A (AO = \J A(N) [ jv , taken over y relatively 
prime to A^ and xy — fcTV = 1, reduces the third equivalence to the second, 
cited at the third step of the computation 

r a {N)s' = r 0 (N)s <^=> s' e A (AO [jv J] * for some y 

A (AOs' = A (AO ^ Q + fcc for some y 
Aa + yc 

[»,'] = ± [* Q + fc y c +^ c ] (mod A^) for some j, y 
= [ a Jc C ] (mod AT) for some j, y, 

where the "±" has been absorbed into y and some constants into j at the last 
step. □ 

Thus the cusps of r(N) are r(N)s, s = a/c, for all pairs ± ["] (mod N) 
where gcd(a, c) = 1. Condition (3) of the next lemma provides a characteri- 
zation purely modulo A^ for these representatives. 

Lemma 3.8.4. Let the integers a,c have images a,c in Z/NZ. Then the fol- 
lowing are equivalent: 

(1) [|] has a lift [ a ',] e Z 2 with gcd(a',c / ) = 1, 

(2) gcd(a,c,J\0 = l, 

(3) [|] has order N in the additive group (Z/NZ) 2 . 

Proof. If condition (1) holds then k(a + sN) +l(c + tN) = 1 for some integers 
k, I, s, t, and so ka + lc+ (ks + lt)N = 1, giving condition (2). 

If condition (2) holds then ad—bc+ kN = 1 for some b, d, and k, and the 
matrix 7 = [« b d ] e M 2 (Z) reduces modulo AT into SL 2 (Z/NZ). Since SL 2 (Z) 
surjects to SL 2 (Z/A r Z), there is a lift € SL 2 (Z), giving condition (1). 

(The reader may remember this argument from the proof of Theorem 1.5.1.) 
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Condition (3) is the implication (fc[§] = 0 in (Z/NZ) 2 AT | fc), 

or equivalently (AT | ka and A" | kc A~ | A;), or cquivalcntly (A" | 

/cgcd(a, c) =>■ AT | fc), or equivalently gcd(gcd(a, c), AT) = 1, or equiva- 
lently gcd(a, c, AT) = 1, i.e., condition (2). □ 

By the equivalence of conditions (1) and (3), the cusps of r(N) are now 
described by the pairs ± ["] of order A" in (Z/NZ) 2 . The bijection is 

± [|] h-> r(N)(a'/c), where [ a J] is a lift of [|] with gcd(o',c') = 1. 

Using condition (2) of Lemma 3.8.4 we can count the cusps of r(N). For 
each c e {0, 1, . . . , N — 1} let d — gcd(c,N). Then the number of values 
a e {0,1,..., AT — 1} such that gcd(a, c, N) — 1 is the number of values 
such that gcd(a, d) = 1, or (N/d)(j)(d) (Exercise 3.8.2(a)). So the number of 
elements of order N in (Z/NZ) 2 is (considering only positive divisors here 
and elsewhere in this section) 

Y,{N/d)cf>{d) ■ \{c : 0 < c < N, gcd(c, N) = d}\ 

d\N 

= j2( N /d)m<t>(N/d) 

d\N 

= N 2 Y[( l - VP 2 ) (Exercise 3.8.2(b)). 

p\N 

For N > 2 each pair ± ["] of order AT elements in (Z/NZ) 2 has two distinct 
members since an element equal to its negative has order 2. For N = 2 the 
"±" has no effect. Thus the number of cusps of r(N) is 



Ul/2)N 2 U PIN (1-1/P 2 ) XN>2, 
1 3 if AT = 2. 



Deriving this formula doesn't really require first describing the cusps 
of r(N) explicitly Since r(N) is normal in SL2(Z), its cusps all have the 
same ramification degree over SL2(Z)oo, most easily computed by applying 
the formula from Section 3.1 at oo, 

erwco = [SL 2 (Z) co : {±I}r(NU = [± ([J {}) : ± ([$ ?]>] = N. 

Since -I <= SL 2 (Z) and -I <= r(N) only for N = 2 (recall that AT > 1), and 
since [SL 2 (Z) : r(N)] — N 3 Y[ p \ N (l — 1/p 2 )> the projection of modular curves 
X(N) — > X(l) has degree 



/p 2 ) if AT > 2, 
if N = 2, 
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cf. Exercise 1.2.2(b). Now the formula e (x> (r(N)) — d^/N reproduces the 
result. 

But since A (TV) is not normal in SL 2 (Z), its cusps must be counted di- 
rectly. The representatives are pairs ± ["] of order TV vectors modulo TV, but 
now by Proposition 3.8.3 a cusp determines the upper entry a modulo the 
lower entry c, so in fact a is determined modulo gcd(c, TV). This time, for each 
c e {0, 1, . . . , TV — 1}, letting d = gcd(c, TV), the number of values a (mod d) 
such that gcd(a, d) = 1 is just 4>(d). So the number of elements of in (Z/TVZ) 2 
that pair to describe cusps of fi(N) is J2d\N <t>{d)4>{N/d)- 



a = 


0 


1 


2 


3 


4 


5 


6 


7 


8 


9 


10 


11 


= 0 


■0---0-0---0 


1 


0 


X 


X 


X 


X 


X 


X 


X 


X 


X 


X 


X 


2 


• 0 


• X 


• X 


• X 


• X 


X 


3 


• 0 0 




X 


X 




X 


X 




X 


X 


4 


■0-0 




X 




X 




X 




X 


5 


0 


X 


X 


X 


X 


X 


X 


X 


X 


X 


X 


X 


= 6 




0 








X 


• X • • • 0 


7 


0 


X 


X 


X 


X 


X 


X 


X 


X 


X 


X 


X 


8 




X 




X 




X 




X 


■0-0 


9 




X 


X 




X 


X 




X 


X 


• 0 0 


10 


• X 


• X 


X 


X 


X 


• 0 


11 


0 


X 


X 


X 


X 


X 


X 


X 


X 


X 


X 


X 



Figure 3.2. The cusps of T(12) and of A (12) 



The table in Figure 3.2 shows all of this for N = 12. The elements [%\ of 
order 12, indicated by "x", pair under negation modulo 12 to represent the 
cusps of r{12). In each row, one element is circled for each value of a modulo 
gcd(c, N), describing the cusps of A(12). The circled elements from the top 
row, where c = 0, pair in an obvious way, and similarly for the left column, 
where a = 0. When ac ^ 0 the circled elements are taken from the left cell 
of length gcd(c, N) on row c when c < 6 and from the right cell when c > 6, 
and these visibly pair. On the middle row, where c = 6 (and more generally 
where c = N/2 when TV is even), the pairing of circled elements is a bit more 
subtle: the left <p(N/2)/2 elements in the left half of the row pair with the 
right 4>(N /2)/2 in the right half unless 4>(N/2) is odd. But <j>(N/2) is odd only 
for TV = 2, when the whole pairing process collapses anyway, and for TV = 4, 
when only four of the five "x"'s in the relevant table pair off, leaving three 
representatives (Exercise 3.8.3). Thus the number of cusps of A(TV) is 

!2 if TV = 2, 

3 if TV - 4, 

sEdlJV^WWd) if TV - 3 or TV > 4. 
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The sum doesn't have as tidy a product form as the corresponding sum 
for r(N) despite appearing simpler (this reflects that ^(N) is not normal 
in SL 2 (Z)), so we leave it as is. 

To count the cusps of r 0 (N) recall from Proposition 3.8.3 that for this 
group, vectors ["] and [°, ] with gcd(a, c) = gcd(a',c') = 1 represent the 

same cusp when y ", = [ a yl c \ (mod N) for some j and y with gcd(y, N) = 

1. The bottom condition, d = yc (mod N) for some such y, is equivalent 
to gcd(c',iV) = gcd(c, N), in which case letting d = gcd(c,N) and letting 
i/o £ Z satisfy yo = e'e^ 1 (mod N) makes the condition equivalent to y = 
y a + iN/d (mod N) for some i (confirming the calculations in the paragraph 
is Exercise 3.8.4). For any divisor d of N, pick one value c modulo N such 
that gcd(c, N) = d. Then any cusp of rb(iV) represented by some vector [°, ] 
with gcd(c',iV) = d is also represented by ["] whenever (y 0 + iN/d)a! = 
a + jc (mod N) for some i and j, or a = y 0 a' (mod gcd(c, N, a'N/d)), or 
a = y a a' (mod gcd(d, N/d)). Also, a is relatively prime to d since gcd(a, d) \ 
gcd(a,c) — 1, so a is relatively prime to gcd(d, N/d). Thus for each divisor d 
of N there are <p(gcd(d, N/d)) cusps, and the number of cusps of r 0 (N) is 

£oo(/ n o(iV)) = ^0(gcd( ( i,7V/d)). 

d\N 

The reader should use this discussion to find the cusps of -T 0 (12) with the help 
of Figure 3.2 (Exercise 3.8.5). 

Let N be any positive integer and let k be odd. All cusps of ro(N) are 
regular since — / e r n (N). To study the cusps of I\(iV), take any s G QU{oo} 
and let 7 £ r\(N) s , so if a £ SL 2 (Z) takes 00 to s then a~ 1 -fa £ (a~ 1 ra) QO , 
If a~ 1 -fa takes the form — [J \\ then 7 has trace —2. But also, traced) = 
2 (mod N), so this can happen only if N | 4. Since — J e A (2), the only case 
where an irregular cusp might occur is I\ (4). This argument shows that all 
cusps of r(N), N 7^ 4, are regular as well. In fact, the only irregular cusp in 
this context turns out to be the example we have already seen, s — 1/2 for 
A (4) (Exercise 3.8.7). 

Finally, the cusps of any congruence subgroup of SL 2 (Z) have a purely 
group-theoretic description. Let G be an arbitrary group and let Hi and H 2 
be subgroups. A double coset of G is a subset of G of the form HigH 2 - The 
space of double cosets is denoted H\\G/H2, that is, 

H{\G/H 2 = {H l9 H 2 : g £ G}. 

As with one-sided cosets, any two double cosets are disjoint or equal, so there 
is a disjoint set decomposition 

G = H 1 gH 2 where R is a set of representatives. 

gen 

A specific case of all this describes the cusps of any congruence subgroup. 
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Proposition 3.8.5. Let r be a congruence subgroup o/SL 2 (Z) and let P be 
the parabolic subgroup of SL 2 (Z), P = {± [J^] : j G Z} = SL 2 (Z) co , the 
fixing subgroup of go. Then the map 

P\SL 2 (Z)/P — ► {cusps ofT} 

given by 

PaP i— > Pa(oo) 
is a bijection. Specifically, the map is r [° ^] P i-> P(a/c). 

Proof. The map is well defined since if PaP = Pa'P then a' = 7a<5 for some 
7 G P and <5 G P, so that Pa'(oo) = P7a<5(oo) = Pa(oo), the last equality 
because 7 G P and (5 fixes 00. 

The map is injective since the condition Pa'(oo) = Pa(oo) is equivalent 
to a'(oo) = 70(00) for some 7 G P, or a - ^ -1 ^' G P for some 7 G P, or 
a' e PaP, meaning Pa'P = PaP. 

The map is clearly surjective. □ 

Another proof, essentially identical, is to show first that SL 2 (Z)/P identi- 
fies with Q U {00}, so that the double coset space P\SL 2 (Z)/P identifies with 
the cusps P\(Q U {00}). 

In particular, the double coset space P(7V)\SL 2 (Z)/P is naturally viewed 
as SL 2 (Z/iVZ)/P where P denotes the projected image of P in SL 2 (Z/7VZ), 
that is, P = {± : j € Z/NZ}. Thus double cosets r(N)aP with a G 
SL 2 (Z) can be viewed as ordinary cosets aP with a € SL 2 (Z/iVZ). Since P 
acts from the right by 



a b 


1 y ± 


~ab-\ 


-ja 


c d 


c d- 




\-jc_ 



the cosets are represented by pairs ± [ " ] where the vector is the left column of 
some a G SL 2 (Z/iVZ) as before. Similarly, when P = A (AT) = Uj [0 1] r (^)' 
the double coset space is naturally viewed as P + \SL 2 (Z/7VZ)/P where 
f+ = { [ 0 1 ] : J G Z} is the "positive" half of P and again the overbar signifies 
reduction modulo N. Since [J 1] [c *] — [ a+ J c *] this recovers the description 
of the cusps of P 1 (A r ). The double coset decomposition P 0 (./V)\SL 2 (Z)/P re- 
covers the cusps of ro(N) in the same way. See Section 1.6 of [Shi 73] for an 
elegant enumeration of the cusps of Po(A r ) using double cosets. 

In Chapter 5 of this book double cosets will define the Hecke operators 
mentioned in the preface. For future reference in Chapter 5 we now state that 
for any prime p, 



A (AO 



1 0 
Op 



Pi(Af) = 7 £M 2 (Z) : 7 = 



1 * 
0 p 



(mod N), det-f = p 



(3.16) 
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Supply details as necessary to the following argument (Exercise 3.8.8(a)). 
One containment is clear. For the other containment, let L — Z 2 and let 
L 0 = L 0 (N) = {[ x y ] G L : y = 0 (mod N)}. Then M 2 (Z) acts on L by 
left multiplication. Take 7 G M 2 (Z) such that 7 = [J *] (mod TV) and det7 = 
p. Because 7 € M 2 (Z) and 7 = [ q * ] (mod TV) , 7L0 C Lq . Because det 7 = p is 
positive, [i : 7L0] = [L : L 0 ] [L 0 : 7X0] = ATp. By the theory of Abelian groups 
there exists a basis {u, v} of L such that det[u v] = 1 and -fL 0 = muZ © nvZ 
where 0 < m \ n and ran = Np. The first column of 7 is [ a 0 " ] (mod N) , but it 
is also 7d (where ei is the first standard basis vector) , an element of -fL 0 and 
therefore [§] (mod to). Because gcd(a 7 , iV) = 1, also gcd(m, N) = 1. Because 
p is prime it follows that to = 1 and n = Ap, 

7X0 = uZ® NpvZ. 

The right side has unique supergroups of index p and TV inside L = uZ 0 vZ, 
so now 

L 0 = uZ® NvZ, jL = uZ® pvZ, jL 0 = uZ 0 iVpwZ. 

Let 71 = [««]. The condition u G i 0 shows that 71 G Jb(jV). Let 72 = 

(71 [J °]) 1 7, an element [" ^] of GLj(Q) with determinant 1 such that as 
desired, 

"1 0" 



7 = 7i 



Op 



72- 



(3.17) 



The condition je\ G jLq is au + cpv G uZ ® NpvZ, or a € Z and c € 7VZ. 
The condition 7e 2 G 7L is ub + pvd G uZ®pvZ, or 6, G Z. Thus 7 2 G r 0 (iV) 
as well. Because a 7 = 1 (mod iV), 7ei = ei (mod iV) and thus (3.17) shows 
that awi = 1 (mod N) using only that 71,72 G r 0 (N) where 7 2 = [" J] . So 
if 7 satisfies (3.17) for 71,72 G r 0 (N) such that either 71 or 7 2 lies in A(-V) 
then both do. To complete the argument it suffices to show that 



To (AO 



1 0 
Op 



r 0 (N) = A (TV) 



1 0 
Op 



r 0 (N). 



For the nontrivial containment, since ri(N)\r 0 (N) is represented by matrices 
of the form Jj] G SL 2 (Z), it suffices to show that for each such matrix there 
exists a matrix S G A (A 7 ") such that *] [J °] r 0 (JV) = 5 [Jp] A^), or a 
matrix <5' G A (A 7 ) such that 



"1 0" 


-1 


a k 




'1 0" 


Op 




N d 




Op 



If p \ N then 5' = works. If p f A 7 then any 5' = 

where d' = 1 (mod A 7 ") and <f = —a (mod p) will do. This completes the 
proof. Exercise 3.8.8(b) requests a similar description of the double coset 

r 0 (N)[h° p ]r Q (N). 
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Exercises 

3.8.1. (a) In the proof of Lemma 3.8.1 show that the fraction (pa+qc) / (ra+tc) 
is in lowest terms. (A hint for this exercise is at the end of the book.) 

(b) Complete the proof of Lemma 3.8.1. 

3.8.2. (a) Let d be a divisor of N. Show that the number of values a modulo N 
such that gcd(a, d) = 1 is (N/d)<p(d). 

(b) Show that Ed\ N ( N /d)Hd)HN/d) = N 2 U P \ N ( l ~ VpO- (A hint for 
this exercise is at the end of the book.) 

3.8.3. Make a table counting the cusps of -T(4), of A (4), and of r 0 (4). 

3.8.4. Confirm the calculations that count the cusps of r 0 (N). 

3.8.5. Find the cusps of r 0 (12). 

3.8.6. Find the cusps of -T(IO), of A (10), and of r 0 (10). 

3.8.7. Show that then when k is odd, all cusps of -T(4) are regular and only 
the cusp s — 1/2 is irregular for A (4). (A hint for this exercise is at the end 
of the book.) 

3.8.8. (a) Supply details as necessary to the proof of (3.16) in the section, 
(b) Analogously to (3.16), describe the double coset r 0 (N) [J °] r 0 (N) as 

a subset of M 2 (Z). 



3.9 More dimension formulas 

This section computes dimension formulas for Aik(r) and Sk(r) when r is 
r 0 (N), A (AO, or r(N). Since we already have formulas for SL 2 (Z), let N > 1 
throughout this section. 

Recall that X{N) = X(r(N)), X 1 (N) = X(A(7V)), and X 0 {N) = 
X(r 0 (N)). Projection X(N) — > X-i(N) has the degree computed in the 
previous section, 

d{ r { N )) = d N = \^ N3 ^ 1 - 1 ^ if7V>2 < 

y y " 1 6 if AT = 2. 

Since -I e A(-/V) if and only if -/ e r(N), and since the index [A (AT) : 
r(N)} is N, projection X^N) — > X(l) has degree 

d(A(JV)) = d N /N for N >2. 

In discussing A (AT) we may take N > 2 since A (2) = A (2). So -I <£ A (AO 
while -7 e SL 2 (Z). Along with the index [r 0 (N) : A (A/')] = 0(7V) this shows 
that projection X 0 (N) — > X(l) has degree 
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d(r 0 (N)) = 2d N /(N<j>(N)) for N > 2. 

By Exercise 2.3.7(a), e 2 (r(N)) = s 3 {r(N)) = 0 for TV > 1. By Exer- 
cise 2.3.7(b), 

Finally, S2(r 0 (N)) and S3(r 0 (N)) were computed in Section 3.7. The data are 
summarized in Figure 3.3. 
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2d N 
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Figure 3.3. Data for the dimension formulas 



For r 0 (N), the formula for the genus g from Theorem 3.1.1 and the 
dimension formulas for A^fc(i~b(A 7 ')) and 5/ s (rb(A r )) when k is even from 
Theorem 3.5.1 don't simplify and are best left in their given forms. (Since 
-I e r 0 (N) the dimensions are 0 for k odd.) But for A (AT) and r(AT), see 
Figure 3.4. In the table, each "±" refers to the dimensions for the modular 
forms and the cusp forms respectively. The last column is split when the for- 
mulas differ for k even and k odd. Since dim(S 2 (-0) = 9 an d dim(.Mo(-0) = 1 
and dim(5o(r)) = 0 and dim(7Wfc(r)) = 0 when k < 0, the only case not re- 
solved by the table is k = 1. 



1 UN = 3, 

0 if N = 2 or N > 3. 
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r 


9 


dim(M fe (T)) for fc > 2 
and dim(5fe(r)) for fc > 3 


(fc even) 


(fc odd) 


A(2) 
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0 


A(3) 
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fc- 1 ±3 
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fc- 1 ±2 
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A (iV) 
(N > 4) 


d\N 


d|iV 


r(2) 


0 
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Figure 3.4. Genera and dimensions for most values of k 



Exercises 

3.9.1. Verify all the entries in Figure 3.4. (A hint for this exercise is at the 
end of the book.) 

3.9.2. Theorem 3.6.1 provides the dimension formulas dim(A^i(r)) — eJJ, s /2 
and dim(<Si(-T)) = 0 when > 2g — 2. Show that this condition holds 
for r(N) if and only if N < 12. 

3.9.3. Show that r 0 (N) = {±I}r 1 (N) for = 2,3,4. What does this show 
about dimA y Jfc(-To(A^)) and dim Sk(r 0 (N)) for these values of A^? As a special 
case of this, note the formula dim(A / J2(^o(4))) = 2 cited in the context of the 
four squares problem in Section 1.2. 



3.9.4. Show that T(2) = 
plain about Figure 3.4? 



0 1/2 
-1 0 



{±7}A(4) 



0 1/2 
-1 0 



. What does this ex- 



3.9.5. For what values of N does the genus g(X(N)) equal 0? 1? 
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Eisenstein Series 



For any congruence subgroup r of SL 2 (Z), the space Mk(r) of modular forms 
naturally decomposes into its subspace of cusp forms Sk(r) and the cor- 
responding quotient space Mk(r)/Sk(r), the Eisenstein space £fc(T). This 
chapter gives bases of £k(r(N)), £k(r\(N)), and subspaces of £k(r\(N)) 
called eigenspaces, including £k(r 0 (N)). The basis elements are variants of 
the Eisenstein series from Chapter 1. For k > 3 they are straightforward to 
write down, but for k = 2 and k = 1 the process is different. 

Aside from demonstrating explicit examples of modular forms, computing 
the Fourier expansions of these Eisenstein series leads naturally to related 
subjects that are appealing in their own right: Dirichlet characters, zeta and 
L- functions, their analytic continuations and functional equations, Bernoulli 
numbers and Bernoulli polynomials, Fourier analysis, theta functions, and 
Mcllin transforms. These ideas are presented in context as the need arises. 
They help to show that a more general kind of Eisenstein series, augmented 
by a complex parameter, satisfies a functional equation as well. At the end 
of the chapter they are used to construct a modular form that is related to 
the equation x 3 = d and to the Cubic Reciprocity Theorem as the motivating 
example in the book's preface is related to x 2 = d and Quadratic Reciprocity. 

Related reading: Various parts of this material are covered in the texts 
[Gun62], [Hid93], [Kob93], [Miy89], [Lan76], [Sch74], and in sections of the 
papers [Hec27] and [Hid86]. 

4.1 Eisenstein series for SL 2 (Z) 

The Eisenstein series Gk (t) for even k > 4 were defined in Chapter 1 , as were 
the normalized Eisenstein series Ek(r) = Gk(r) /(2C t (k)) with rational Fourier 
coefficients and leading coefficient 1. Recalling that a primed summation sign 
means to sum over nonzero elements, compute that 
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C "«= E' 777^ = E E 



, (cT + d) k ^ ^ (cT + d) k 

(c,d)eZ 2 v ; n=l (c,d) v ; 

gcd(c,d)=n 

°° 1 1 1 

= Z ( C T + d) fe " C(fc) Z ( CT + d)* • 

n=l (c,d) V ; (c,d) V ; 

gcd(c,d) — 1 gcd(c,d)=l 

These calculations are valid since the series Gfc(r) converges absolutely for all 
integers k > 3, as shown in Exercise 1.1.4. It follows that 

«M-i E («> 

( c ,d)ez 2 v ; 

gcd(c,d) = l 

The series defining and E/, cancel to zero for odd k. 

Define P + = {[J "] : ™ € Z}, the positive part of the parabolic subgroup 
of SL 2 (Z). This allows an intrinsic description of the normalized Eiscnstein 
series (Exercise 4.1.1), 

Ek{r) = \ Y, 3{l,r)- k . (4.2) 
7 eP+\SL 2 (z) 

To show intrinsically that E k is weakly modular of weight k, compute that 
for any 7 e SL 2 (Z) 

{E k [ 1 ]k){r)=\j{ 1 ,T)- k J2 i(7',7(r))- fe , 

7'GP+\SL 2 (Z) 

so the relations j{i ,~({t)) = i(l'7, r)/j(7, r) and P+\SL 2 (Z)7 = P+\SL 2 (Z) 
show that the right side is again E k (r). This argument isn't really different 
from the nonintrinsic proof from Section 1.1 (and they both require the ab- 
solute convergence shown there in Exercise 1.1.2 to rearrange the sum), but 
the intrinsic method is being shown for elegance here and in the next section 
and because it remains tractable in more general situations. 

This chapter will construct Eiscnstein series for congruence subgroups 
of SL 2 (Z). For any congruence subgroup E and any integer k, define the 
weight k Eisenstein space of E to be the quotient space of the modular forms 
by the cusp forms, 

s k (r) = M k (r)/s k (r). 

Recall from Chapter 3 that denotes the number of cusps of the compact 
modular curve X(E) and e^ g denotes the number of regular cusps. By the 
dimension formulas from Chapter 3 (Exercise 4.1.2), 



dim(£ fc (r)) 
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£oo if fc > 4 is even, 

e£ s if fc > 3 is odd and -/ £ r, 

eco-1 if* = 2, 

e^ g /2 if fc = 1 and -7 <£ T, v ; 

1 if fc = 0, 

0 if fc < 0 or if fc > 0 is odd and -7 e T. 



At the end of the next chapter the Eisenstein space £k{r) will be redefined 
as a subspace of Mk{r) complementary to Sfc(T'), meaning the subspaces are 
linearly disjoint and their sum is the full space. In the meantime it suffices to 
think of £k{r) as a quotient space. 

Exercises 

4.1.1. Show that the intrinsic sum for E^ in (4.2) agrees with the sum in (4.1). 

4.1.2. Confirm formula (4.3). 

4.2 Eisenstein series for T{N) when k > 3 

Let iV be a positive integer and let v G (Z/7VZ) 2 be a row vector of order N. 
(In this chapter an overline generally denotes reduction modulo N, so for 
example v is any lift of v to Z 2 . Occasionally the overline will denote complex 
conjugation instead, but context will make the distinction clear.) Let S = 
[c v d v ] G SL 2 (Z) with (cv,d v ) a lift of v to Z 2 , and let fc > 3 be an integer. 
Let e N be 1/2 if TV e {1,2} and 1 if N > 2. Define (Exercise 4.2.1) 

El(r) = e N Yl (cr + d)- k = e N ^ j(l,r)- k . (4.4) 

(c,d)=v(N) -ye(P + nr(N))\r(N)8 

gcd(c,(i) — 1 

When N = 1 there is only one choice of v and E% is E/. from before. Note 
that E^ v = (—l) k El. This is a special case of 

Proposition 4.2.1. For any 7 <G SL 2 (Z), 



Proof. Compute for any 7 G SL 2 (Z), using the fact that r(N) is normal 
in SL 2 (Z) at the second and fourth steps, that 
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{Etb]k){r) = e N j( 7 ,r)- k E j(l'n(r))- k 

ye(P+nr(N))\r(N)s 

= E j{ii^y k 

ye(P+nr(N))\sr(N) 

E j(7vr fc 

7"e(p+nr(AT))\(5r(Ar)7 

E J(7",r)- fe = i?r(r). 

7"e(p + nr(jv))\r(jv)5 7 



Corollary 4.2.2. E% G M k (r(N)). 



□ 



Proof. As a subseries of the series E k analyzed in Exercise 1.1.4, E^ is holo- 
morphic on H, meeting condition (1) of Definition 1.2.3. Since every 7 G r(N) 
reduces to the identity matrix modulo N, Proposition 4.2.1 here shows that 
Wy = v and E% is weight-fc invariant with respect to r(N), i.e., it satisfies 
condition (2) in the definition. Finally, the Fourier expansion calculation to 
follow will produce coefficients meeting condition (3') of Proposition 1.2.4, 
showing that £7£ satisfies condition (3) in the definition as well. □ 

With the corollary established, one can symmetrize the Eisenstein series 
E% to create modular forms for any congruence subgroup of level N (Exer- 
cise 4.2.2). 

It is straightforward (Exercise 4.2.3) to compute that 



li , , (±lf if v = ±(0,1), unless k is odd and N G {1,2}, 
hm E%(t) = < 
im(-r)^oo J 0 otherwise. 

When k is odd and N G {1,2}, the Eisenstein space £k(r(N)) has dimension 0 
since — / G r(N). Excluding these cases for the rest of this paragraph, E^ is 
nonvanishing at 00 if v = ±(0, 1) and vanishes at 00 otherwise. Next take any 
v = (c,d) G (Z/7VZ) 2 of order N with corresponding 6 = [ a c b d ] G SL 2 (Z), and 
take any cusp s — a'/c' G Q U {00}, so that a matrix a = [°, G SL2(Z) 
takes 00 to s. The behavior of E% at s is described by the Fourier series of 
El[a]k- By Proposition (4.2.1) 



j?vr r ] rpvot 17,(0, l)Sa 

E k [a\ k - E k - E k 



Thus E^[a]k is nonvanishing at 00 only when (0, l)Sa — ±(0, 1), or (0, 1)5 = 
±(0, l)^- 1 , or [»,'] = ± [~*] (mod N). By Proposition 3.8.3, this is equiva- 
lent to r(N)s = r(N)(—d/c), showing that is nonvanishing at r(N)(-d/c) 
and vanishes at the other cusps of r(N). 
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If k is even or N > 2, pick a set of vectors {v} — {(c,d)} such that 
the quotients —d/c represent the cusps of r(N), cf. Section 3.8. The preced- 
ing paragraph shows that the corresponding set {E%} of Eisenstein series is 
linearly independent. The set contains s 00 (r(N)) elements, so it represents 
a basis of £k(r(N)) according to formula (4.3) since all cusps of r(N) are 
regular. Since £ k (r(N)) — {0} when k is odd and N G {1,2}, we have a 
basis of £k(r(N)) in all cases when fc > 3. The basis elements are cosets 
El + S k (r(N)) since £ k (r(N)) is the quotient space M k (r(N))/S k (r(N)). 
Once £k(r(N)) is redefined as a subspace of Mk(r(N)) in the next chapter, 
the basis elements will be the Eisenstein series themselves. 

The series E k are normalized, but in Chapter 1 it was the nonnormalized 
series Gk whose Fourier series was readily calculable. Analogously, define for 
any point v € (Z/NZ) 2 of order N 

GI(t) = y ' -— -r. 

ky ' ^ (cT + d) k 

(c,d)=v(N) y ' 

Any (c, d) in this sum satisfies gcd(gcd(c, d),N) = 1. Thus 

1 

Gl(r)= E E J^TdY 

n=l (c,d)=v(N) V ' 
gcd(n,AT) = l gcd ( Cid ) = „ 

j/^n -, (c',d')=n- 1 v(N) 



= — E c?(fc)i?r l5 (r) 



cat 

n£(Z/JVZ)* 



where the modified Riemann zeta function (essentially the Hurwitz zeta func- 
tion to be defined in Section 4.7) is 



oo 



m 

m—l 

m=n (N) 

(Note that (Z/1Z)* = {0} since 0=1 (mod 1), i.e., 0 is multiplicatively 
invertible for once. This is consistent with the definition 4>(1) = 1 for the 
Euler totient function. So G is the usual zeta function when N = 1.) Also, 
letting /j, denote the Mobius function from elementary number theory, Mobius 
inversion shows that (Exercise 4.2.4) 

El{r) = e N E C+(k,^G n k - lv (r) (4.6) 

nG(Z/JVZ)* 
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where as before cn is 1/2 if AT e {1,2} and 1 if AT > 2, and where the 
coefficient in the sum is another modified zeta function, 

E ^ »e(Z/JVZ)-. 

m— 1 

m=n (N) 

The set {G£} for the same vectors v as before also represents a basis of 
£k(r(N)). We will reformulate the modified zeta functions in Section 4.4. 
To compute the Fourier expansion of G v k when N > 1, start from 

g U t ) = E E ( CT+( i, + a^ = ivfe E E (CT+dv ,k > ( 4 - 7 ) 

where all congruence conditions in this calculation are modulo N and all sums 
are taken over positive and negative and possibly zero values unless otherwise 
specified. Examining the first sum in (4.7) shows that the constant term of 
G%(t) is (Exercise 4.2.5(a)) 

I 1 if ^ =0 1-E'i (^) 
0 otherwise [ f-f d k 



d=d v 



Next recall equation (1.2), 



1 oo 



for r e "H and k> 2, 

, z^ T + d ) k 

where Cfe = (—2iri) k /(k — 1)1 (this symbol will be used repeatedly for the 
rest of the chapter) and q = e 2T ™ r . If c > 0 in the second sum in (4.7) 
then (ct + d v )/N <G "H and so letting this be the r in (1.2), part of (4.7) is 
(Exercise 4.2.5(b)) 

--,00 ^-y OO 

|EE™ k "Vrc4E E ™ fc -v^, (4.9) 

c=c„ m=l n=l mln 

c>0 n/rn=c v 

m>0 

where /Ltjv = e 27Tl / N and qjv = e 27rlT / N _ Note that the inner sum is zero unless 
n = c„. Similarly, if c < 0 in (4.7) then — (ct + d v )/N e "H and so the rest 
of (4.7) is (Exercise 4.2.5(c)) 

^ OO --,00 

c=c„ m=l n=l mln 

c<0 „/m=c, 

m<0 

(4.10) 

Assembling the results of this calculation gives 
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Theorem 4.2.3. The Fourier expansion ofG%(r) fork>3 andv G (Z/NZ) 2 
a point of order N is 

„ oo 

Gf(r) = 5(c v )^(k) + j± ]T *Li(»)«ft. q N = e 2 ™' N 

71=1 

where 

d=d v (N) 

£ S gn(m)m fe -V^ m . 

n/m=c v (N) 

The sums for ( dv and are taken over positive and negative values of d 

and m. 

Any set {G v k {r)} with one v corresponding to each cusp of T(N) represents 
a basis of£ k {r(N)). 

When N = 1, the Fourier expansion agrees with Gfc(r) from Chapter 1 
for k even and it cancels to zero when k is odd (Exercise 4.2.6). Clearly the 
nth Fourier coefficient is bounded by Cn k , completing the proof that E k is a 
modular form. 

Exercises 

4.2.1. Show that the two sums in (4.4) agree. 

4.2.2. Let N, v, and k be as in the section. Let T be a congruence subgroup 
of SL 2 (Z) of level N, so that T(N) cfc SL 2 (Z). Define 

E t.r= £ E khih, 
7 3 er(Af)\r 

the sum taken over a set of coset representatives. Show that E v k r is well 
defined and lies in Mk(T). 

4.2.3. Establish the formula in the section for Hmi m (T)^oo E k (r). 

4.2.4. (a) The Mobius function fj, : Z+ — > Z is defined by 

/ \ I ( — I) 9 if n = Pi • • • Pa for distinct primes pi, .... p„, 

/*(") = U -f 2 | f • 

10 if p I n for some prime p. 

This includes the case = 1. Show that for n € Z+, 



5(c v ) = 

and 



1 i/ c v = 0, 1 
0 otherwise \ 
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d\n 
d>0 



( 



1 i£n=l, 
0 ifn>l. 



(b) Let v € (Z/iVZ) 2 be a point of order N. It follows from part (a) that 



Rearrange the sum to obtain formula (4.6). (A hint for this exercise is at the 
end of the book.) 

4.2.5. (a) Confirm that the constant term of (4.7) is given by (4.8). 

(b) Show that the terms of (4.7) with c> 0 sum to (4.9). 

(c) Show that the terms of (4.7) with c < 0 sum to (4.10). 

4.2.6. Confirm that when N = 1 in Theorem 4.2.3, the series becomes the 
Fourier expansion of Gfc(r) from Chapter 1 when k is even and cancels to zero 
when k is odd. 

4.3 Dirichlet characters, Gauss sums, and eigenspaces 

Computing the Eisenstein series for the groups A(-W) and ro(N) requires 
some machinery, to be given over the next two sections. For any positive 
integer N, let Gn denote the multiplicative group (Z/iVZ)*, of order (j>(N) 
where <f> is the Euler totient, when this doesn't conflict with the notation 
for Eisenstein series. A Dirichlet character modulo N is a homomorphism of 
multiplicative groups, 



For any two Dirichlet characters \ an d 4> modulo N, the product character 
defined by the rule (xV0( n ) = x( n )' i / ; ( n ) for n € Gn is again a Dirichlet char- 
acter modulo N. In fact, the set of Dirichlet characters modulo N is again a 
multiplicative group, called the dual group of Gn, denoted Gn, whose iden- 
tity element is the trivial character modulo N mapping every element of Gn 
to 1, denoted ljv or just 1 when N is clear. Since Gn is a finite group the 
values taken by any Dirichlet character are complex roots of unity, and so 
the inverse of a Dirichlet character is its complex conjugate, defined by the 
rule x( n ) = x( n ) f° r a U n € Gn- (So here the overline denotes complex 
conjugation, not reduction modulo N .) As explained in the previous section, 
Gi = {0}, and so the only Dirichlet character modulo 1 is the trivial charac- 
ter 1\. 

For any prime p the group G p is cyclic of order p—1. Let g be a generator 
and let fi p -i be a primitive (p — l)st complex root of unity. Then the group 
of Dirichlet characters modulo p is again cyclic of order p—1, generated by 
the character taking g to In general (see, for example, [Ser73]), 




(c,d)=v (N) n|gcd(c,d) 
n>0 




c*. 
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Proposition 4.3.1. Let Gn be the dual group o/Gn- Then Gn is isomorphic 
to Gn . In particular, the number of Dirichlet characters modulo N is <j>(N) . 

The two groups are noncanonically isomorphic, meaning that construct- 
ing an actual isomorphism from Gn to Gn involves arbitrary choices of which 
elements map to which characters. The groups Gn and Gn satisfy the orthog- 
onality relations (Exercise 4.3.1), 

n£G N k XSGjv ^ 

Let N be a positive integer and let d be a positive divisor of N. Every 
Dirichlet character \ modulo d lifts to a Dirichlet character xn modulo N, 
defined by the rule xn(k (mod N)) — \{n (mod d)) for all n e Z relatively 
prime to N. That is, x^v — X ° w N,d where 7Tjv,d : Gn — > Gd is natural 
projection. For example, the Dirichlet character modulo 4 taking 1 to 1 and 
3 to —1 lifts to the Dirichlet character modulo 12 taking 1 and 5 to 1 and 7 
and 11 to —1. Going in the other direction, from modulus N to modulus d, 
isn't always possible. Every Dirichlet character \ modulo N has a conductor, 
the smallest positive divisor d of N such that \ — Xd°^N.d for some character 
Xd modulo d, or, equivalently, such that x is trivial on the normal subgroup 

Kna = ker(7TAT !d ) = {n e Gn : n = 1 (mod d)}. 

For example, the Dirichlet character modulo 12 taking 1 and 7 to 1 and 5 
and 11 to —1 has conductor 3. A Dirichlet character modulo N is primitive 
if its conductor is N. The only character modulo N with conductor 1 is the 
trivial character ljv, and the trivial character ljv modulo N is primitive only 
for N= 1. 

Every Dirichlet character x modulo N extends to a function \ : Z /NTi — > 
C where x( n ) = 0 f or noninvertiblc elements n of the ring Z/NZ, and then 
extends further to a function x '■ Z — > C where (abusing notation) x( n ) = 
x(n (mod N)) for all n G Z. Thus x( n ) = 0 fo r au n such that gcd(n, N) > 1. 
The extended function is no longer a homomorphism, but it still satisfies 
x(nm) = x(n)x(m) for all n, m. In particular, the trivial character modulo iV 
extends to the function 



liv(n) 



1 if gcd(n,iV) = 1, 
0 if gcd(n, N) > 1. 



Thus the extended trivial character is no longer identically 1 unless AT = 1. 
The extension of any Dirichlet character x modulo N satisfies 



X(0) 



1 if iV = 1, 
0 if AT > 1. 
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Summing over n = 0 to TV — 1 in the first orthogonality relation and taking 
n€ Z in the second gives modified versions, 

<t>{N) if x = 1, V ()= UW ifn=l(JV), 
0 if X t^I, ^ XW [0 ifn^l (TV). 

The Gauss sum of a Dirichlct character \ modulo TV is the complex number 
w-i 

9(X) = E = e 2 ™ /Ar . 

n=0 

If x is primitive modulo TV then (Exercise 4.3.2) for any integer m, 

N-l 

E X(n)/4T = X(m)g(x). (4.11) 

It follows that the Gauss sum of a primitive character is nonzero. Indeed, the 
square of its absolute value is 

JV-l N-1N-1 

9(X)9(X)= E x(m)g( X )^ N m = E E X^TvjT by (4.11) (4.12) 

m— 0 m— 0 n— 0 

JV-l AT-1 

= E E "&" 1)m = (4 - 13) 

n— 0 m— 0 

the last equality holding because the inner sum is TV when n = 1 and 0 
otherwise. Formula (4.11) and the following lemma will be used in the next 
section. 

Lemma 4.3.2. Let TV be a positive integer. If TV = 1 or TV = 2 then every 
Dirichlet character \ modulo TV satisfies x(— 1) = 1. If N > 2 t/ien the number 
of Dirichlet characters modulo TV is even, half of them satisfying x(— 1) = 1 
and the other half satisfying x(— 1) = — 1. 

Proof. The result for TV = 1 and TV = 2 is clear. If TV > 2 then 4 | TV or p \ TV 
for some odd prime p. The nontrivial character modulo 4 takes —1 (mod 4) 
to —1, and for every odd prime p the character modulo p taking a generator g 
of G p to a primitive (p — l)st complex root of unity takes —1 (mod p) to —1 
since —1 = g(p -1 )/ 2 (mod p). In either case the character lifts to a character 
modulo TV taking —1 (mod TV) to —1. 

Let Gat denote the group of Dirichlet characters modulo TV. The map 
Gn — > {±1} taking each character \ to x(— 1) is a homomorphism. We have 
just seen that the homomorphism surjects if TV > 2, and so the result follows 
from the First Isomorphism Theorem of group theory. □ 



N-l 



E x( n ) = 



n=0 
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We are interested in Dirichlet characters because they decompose the vec- 
tor space M.k{r\(N)) into a direct sum of subspaces that we can analyze in- 
dependently. For each Dirichlet character \ modulo N define the x- e ig ens P ace 
of Mk{r\{N)), 

M k (N,x) = {/ € M fe (A(A0) : fh\ k = X (dj)f for all 7 e r 0 (N)} . 

(Here d~ ( denotes the lower right entry of 7.) In particular the eigenspace 
M k (N, 1) is M k {r 0 (N)) (Exercise 4.3.3(a)). Also note that M k (N, \) is just 
{0} unless = (-l) fe (Exercise 4.3.3(b)). The vector space Mk^N)) 

decomposes as the direct sum of the eigenspaces (Exercise 4.3.4(a)), 

M k (r 1 (N)) = ®M k (N,x), 

x 

and the same result holds for cusp forms (Exercise 4.3.4(b)), so it holds for 
the quotients as well (Exercise 4.3.4(c)), 

£ k (r 1 (N)) = ®£ k (N, X ). 

x 



Exercises 



4.3.1. Prove the orthogonality relations. (A hint for this exercise is at the end 
of the book.) 

4.3.2. Let x be a primitive Dirichlet character modulo N and let m be an 
integer. Prove formula (4.11) as follows. 

(a) For m = 0, confirm the formula for \ = li an d for nontrivial \. 

(b) Now assume m^O. Let g = gcd(m, N), N — N'g, and m = m'gh for 
the minimal h such that gcd(m', N) = 1. Show that since x( m ')x( m ') = 1j 



jv-i 



iV'-l 



/ 



£x(»Kr = *K) E 



n=0 



n'=0 



N-1 



n=0 
\n=n' (N') 



X{n) 



1 h 



(4.14) 



Let K = K N . N , = {n e (Z/NZ)* : n = 1 (mod N')}, the subgroup 
of (Z/NZ)* introduced in the section. Use the fact that x 1S primitive to 
show that 

if .9=1, 
if g > 1. 



Since (Z/NZ)* = (J , n'K where the coset representatives n' taken modulo N' 
run through (Z/N'Z)*, show that the inner sum in (4.14) is X^nen'if x( n )> 
and that this is x( n ') if <? = 1 and 0 if g > 1. Show that formula (4.11) follows 
in both cases. 
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4.3.3. (a) Show that M k (N, 1) = M k (r 0 (N)). 

(b) Show that M k {N,x) = {0} unless = (-l) fe - 

4.3.4. (a) Show that J Mfe(A(iV)) = 0 M k (N,x)- (A hint for this exercise 

is at the end of the book.) 

(b) Show the same result for the cusp form spaces S k (N,x)- 

(c) Show the same result for the Eisenstein spaces £ k (N, x) ■ 



4.4 Gamma, zeta, and X-functions 

This section gives a brief discussion of three functions from complex analysis. 
Some results are stated without proof since the arguments are written in many 
places, see for example [Kob93]. For any complex number s with positive real 
part, the gamma function of s is defined as an integral, 

f°° rlt 
r(s)= e~H s -, seC, Rc(s)>0. 
Jt=o t 

Since \t s \ = t Rc ( s ) for t > 0, the condition on s ensures that the integral 
converges at t = 0, and since the integrand decays exponentially as t — > oo, 
the integral converges at the other end as well. It is easy to show that -T(l) = 1, 
that r(l/2) — y/ir, and that _T satisfies the functional equation 

r(s + i) = s r( s ) 

(Exercise 4.4.1(a-c)). Consequently -T(n) = (n — 1)1 for n e Z + . (This tidies 
the notation for the ubiquitous constant C' k of this chapter, which is now 
(—2iri) k /r{k).) Rewriting the functional equation as r(s) — r(s+ \)/s defines 
the left side as a meromorphic function of s for s- values with Rc(s) > — 1 and 
thus extends the domain of r one unit leftward. Repeatedly applying the 
functional equation this way extends the domain of r to all of C, and now 
the functional equation holds for all s. For any n£ N, it gives 

v , x T(s + n+l) (-l) n 

lim (s + n)T(s) = lim 



s^-n s(s + 1) • • • (s + n — 1) n\ ' 

showing that the extended F function has a simple pole at each nonposi- 
tive integer with residue Res s= _„_T(s) = (— l)"/n!. We will need the formula 
(Exercise 4.4.1(d)) 

r(s)r(i-s) = -^- (4.15) 

sm(7rs) 

in Section 4.7. This shows that the extended Gamma function has no zeros, 
making its reciprocal l/r(s) entire. We will need the formula (Exercise 4.4.2) 

- k — jt^- = — for even integers k > 2 (4.16) 

7r-*r(f) 2 
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in the next section. 

The Riemann zeta function of the complex variable s is 

oo 1 

71=1 

The sum converges absolutely. An elegant restatement due to Euler of the 
Fundamental Theorem of Arithmetic (positive integers factor uniquely into 
products of primes) is 

where V is the set of primes. Define 

S;( S )=ir- s / 2 r( S /2)C(s), Re(s)>l. 

Section 4.9 will prove that the function £ has a meromorphic continuation to 
the entire s-plane satisfying the functional equation 

£(s) = f (1 - s) for all s E C 

and having simple poles at s = 0 and s = 1 with respective residues — 1 and 1 . 
By the properties of the gamma function, this shows that the Riemann zeta 
function has a meromorphic continuation to the entire s-plane, with one simple 
pole at s = 1 having residue 1 and with simple zeros at s = —2, —4, —6, . . . 
It also shows that £(0) = —1/2 and gives the values of £(— 1), ((— 3), £( — 5), 
... in terms of C(2), C(4), C(6), ■ ■ ■ , which were computed in Exercise 1.1.7(b). 
See Exercise 4.4.3 for an appealing heuristic argument due to Euler in support 
of these values £(1 — k) for even k > 2. 

Every Dirichlct character \ modulo N has an associated Dirichlet In- 
junction similar to the Riemann zeta function, 

L ^X) = Y J ^ L =Y[{1-X{p)p- S r\ Re(*)>l. (4.17) 

Again this extends meromorphically to the s-plane and the extension is entire 
unless x = Ijv, m which case the L- function is essentially the Riemann zeta 
function (Exercise 4.4.4) and again has a simple pole at s = 1. When x(— 1) = 
1, the functional equation satisfied by L(s, \) is 

7r- s / 2 r(§ )N S L( S , x) - K- {l - s)/2 r{^)g{ X )L{l -*,*), (4.18) 

and when x(— 1) = —1 it is 

^-( s +i)/2 r( s±i )tvU(s, X ) = -i7T-( 2 - s )/ 2 r(^). 9 (x)£(l - s, x). (4.19) 

We will not prove these, but see the comment at the end of Section 4.7. 
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The modified zeta functions from Section 4.2, 

OO OO / \ 

<?(*)= E ^. w.m)= E 



m— 1 
m=n (TV) 



m— 1 

m=n (AT) 



m 



n G (Z/NZ)*, 



are expressible in terms of L- functions. For any n € (Z/NZ)* the second 
orthogonality relation shows that summing over Dirichlet characters picks 
out the desired terms of the full zeta function, 

1 °° ^ 1 

ATi E x(n- 1 )L(a,x)=E0(]v) E Xtmn" 1 (mod JV))- 



oo 1 

E ^J = C(-). Re(«)>l- 



m— 1 

m=n (AT) 



Since the sum of L-functions extends meromorphically to the full s-planc 
with a simple pole at s = 1, so does C+( s )- Substituting the sum for C+( s ) m 
relation (4.5) gives 



(4.20) 



with the sum taken over x € (Z/7VZ)* and n e (Z/NZ)*. To invert this rela- 
tion, take an arbitrary linear combination of series G™ " over m G (Z/NZ)* , 



^ a m G™ 



1 



£iV0(iV) 



e^0(iV) 



^ a raX (n)I(fc,x)£r" 5 



m,x,n 



^ a mX (mn)L(fc, X )£f 
E a mX (m)x(n)L(k, X )Ef. 



The left side is the inner product (without complex conjugation) (a, G) where 
a is the vector with entries a m and G is the vector with entries G™ v . The 
right side is the inner product l/eN(aABk,E) where A and B 8 (s e C) are 
the matrices 



X(m) 

(<f>(N)y/* 



( m ,x)eG N xGjv 



x(n)L(s,x) 
(4>(N)y/* 



(X,«)6G JV XG JV 



and E is the vector with entries E™ . By the second orthogonality relation, 
A* A = I where A* is the adjoint (transpose-conjugate), and so A is invertiblc. 
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By the first orthogonality relation, B S B* (where the adjoint operator conju- 
gates complex scalars but not the variable s) is the diagonal matrix with 
diagonal entries L(s,x)L(s,x), and so B s is invertible as a matrix of mero- 
morphic functions on C. The product formula (4.17) shows that L(s,x) ^ 0 
when Re(s) > 1, and so the meromorphic functions in the inverse matrix have 
no poles when Re(s) > 1, in particular when s is an integer k > 3. Letting e\ 
denote the first standard basis vector and choosing a(k) — e 1 {ABk)^ 1 gives 

if = £A r5> m (fc)G^ 1,J . (4.21) 

m 

Comparing this to (4.6) shows that (+(k,[i) — a n -i(k) for integers k > 3, and 
in fact the argument establishes the same relation replacing k by any complex 
s with Re(s) > 2. Thus C+( s > /•*) is also a meromorphic function that continues 
to the full s-plane. 

The meromorphic continuation of £™ (s) quickly gives an analytic continu- 
ation of the function C"( s ) = Tl'm=n(N) l/ mS occurring in the constant term 
of GJ^(t), cf. Theorem 4.2.3 (Exercise 4.4.5(c)). We will need the formula 
(Exercise 4.4.5(d)) 

C " (1) = ^ + ^ COt ^)' &=d(n,N) = l. (4.22) 

in Section 4.8. 



Exercises 

4.4.1. (a) Show that T(l) = 1. 

(b) Show that 1^(1/2) = y^. (Hints for this exercise are at the end of the 
book.) 

(c) Show that r(s + 1) = sr(s) when Rc(s) > 1. 

(d) For any positive integer n consider the integral 

Thus limn^oo I n (s) = r(s). Change variables to get 

I n (s) = n s ( (l-t) n t s - 1 dt, 
Jt=o 

integrate by parts to show that I n (s) — (l/s)(n/(n — l)) s+1 / n _i(s + 1), and 
evaluate Ii(s + n— 1) to conclude that 

n s 

* 11^=1(1 + «/"») 
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Therefore 



I n {s)I n {-s) = 



-* 2 n;u(i-s7™ 2 r 

Letting n — > oo, use the formula sin(7rs) = 7rs IIm=i(l ~ s 2 / to2 ) and the iden- 
tity — s_T(— s) = 7^(1 — s) to prove (4.15). If you have the relevant background, 
make this heuristic argument rigorous by citing appropriate convergence the- 
orems for integrals and infinite products. 

4.4.2. Prove formula (4.16). (A hint for this exercise is at the end of the book.) 

4.4.3. This exercise presents an argument due to Eulcr for the functional 
equation of C( s )- 

(a) Let t be a formal variable. Starting from the identity 

t + t 2 + t 3 + t 4 + ■ ■ ■ = (t - t 2 + t 3 - t 4 + ■ ■ ■ ) + 2{t 2 + t 4 + t 6 + t 8 + ■■■), 

show that applying the operator t^ t (i.e., differentiation and then multiplica- 
tion by t) n times gives 



t^] ) + 2 n+1 (l n t 2 + 2 n t 4 + 3 n t e + 4 n t* + ■■■)■ 



l n t + 2 n t 2 + Tt 3 + A n t 

dt) \l + t 

Formally, when t = 1 this is £(— n) = (i^)" (l+t) 2" +1 C(~ n ) ; giving a 
heuristic value for £(— n). Thus for example, according to Eulcr, 1 + 1 + 1 + 
1 + •• • = -1/2 andl + 2 + 3 + 4+--- = -1/12. 

i d d 

'dt ~ dX' 



(b) Let t = e x and note that t4z = -r£. Now we have 



(l-2" +i )C(-n) 



giving the Taylor series 



e x 



dX n V e x + 1 



for n e N, 



= (l-2"+ 1 )C(-n) ^„ 
e x + l ^ n\ 

n=0 

Thus the function of a complex variable 

= e 2,« = ~ (l-2"+ 1 )C(-n)(27ri)" 
1 ' e 27riz + 1 ^ rc! 

generates (in the sense of generating function, cf. Section 1.2) the values C(~ n ) 
for all natural numbers n. 

Let G(z) = TTCotirz. Recall from equations (1.1) that 
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0 2niz 



P 2mz 4-1 1 ^2, 



fc=i 

The rational forms of F and G look similar, suggesting a relation between 
C(l — k) and Q{k) for even k>2. Use the rational forms of F and G to show 
that in fact 

-(G(z) - 2G(2z)) - -F(z) + F(-z), 
and then equate coefficients to obtain 



2r(fc) 

(27ri)* 



C(! - fc ) = 7S3n£C(*0 for even k > 2. 



The value of £(fc) computed in Exercise 1.1.7 shows that £(1 — k) — —Bk/k 
for even k > 2 where is the Bernoulli number, 
(c) Use formula (4.16) to show that 

^- fc/2 r(f)C(fc) = ir- {1 - k)/2 r{^±)({l - k) for even k > 2, 

giving a partial version of the functional equation. 

The ideas here can be turned into a rigorous proof of the meromorphic 
continuation and the functional equation of (, cf. [Hid93]. 

4.4.4. Show that L(a, 1 N ) - CM U p \nO- ~ P~ S )- 

4.4.5. Take the nonpositivc imaginary axis as a branch cut in the z-plane, so 
that arg(z) G (— tt/2, 37r/2) on the remaining set and the function log(z) = 
In \z\ + iarg(z) is single-valued and analytic there. For any z off the branch 
cut define 

z s = e sXo ^ z \ seC. 

(a) Show that (— r) s = (— l) s r s for any positive real number r and any 
s e C, even though the rule (zw) s — z s w s does not hold in general. 

(b) Show that for any n e Gat, 

C(s) = c?W + (-i)- s C"(s), Ms) > i- 

(c) It follows that 

C " (S) = t(Nj 5 (x(n ~ 1} + (-^"^((-n)- 1 ))^^, X), Re( S ) > 1. 

This continues meromorphically to the full s-planc with the only possible pole 
coming from the term where \ = ljv- Show that 

lim ^(^(n- 1 ) + {-\)-l N {{-n)- l ))L{8, 1 N ) = ^. 
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Thus the continuation of ( n (s) is entire. (Hints for this exercise are at the end 
of the book.) 

(d) For x 7^ Ijv, the analytic continuation of L(s, x) is bounded at s = 1. 
Show that for TV > 1, 

C " (1) = £ + Jn) E (x(n ^ } - x((-n)- 1 ))i(l, X) 

4.4.6. Substitute u 2 = t in the definition of r to get 

r°° , ,/,, 
r{s) = 2 e- u u 2s — , s e C, Re(s) > 0. 

Ju=0 u 

Show that therefore for a, b > 0, 



r( a )r(b) = r(a + b) 



Jx=0 



This last integral is a _Beto integral. Show also that for any positive integer to, 

1 f 1 



L 



I x™ 1 (l-x) *dx. 

Jx=0 



t=o VI -t m m J x=0 

Use these results to explain the last equalities in the formulas for -074 and -073 
at the end of Section 1.1. (A hint for this exercise is at the end of the book.) 



4.5 Eisenstein series for the eigenspaces when k > 3 

The Eisenstein series G" k from Section 4.2 are linear combinations of the series 
E k , so the transformation rule of Proposition 4.2.1 applies to them as well, 
G v k [i\k = G^ for all 7 <= SL 2 (Z). Note that vectors modulo N of the form 
v = (0, d) satisfy 

(0,d) 7 = (0,dd 7 ) for all 7 e r 0 (N), 

where d 7 is the lower right entry of 7. Thus, symmetrizing over d gives a sum 
of Eisenstein scries 

E 4 M) 

dG(Z/ATZ)* 

lying in A4fc(-To(A^)) (Exercise 4.5.1(a)). Similarly, if x is a Dirichlct character 
modulo N then the sum 



E x(rf)4 M) 



d£(Z/NZ)* 
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(here \ is the complex conjugate of \) ncs m -Mk(N,x) (Exercise 4.5.1(b)). 
This section generalizes these ideas to construct bases of the spaces £k{N, x)- 

For any two Dirichlet characters ip modulo u and ip modulo v such that 
uv = N and (i/></?)(— 1) = (~ l) fe (here the characters are raised to level uv so 
their product makes sense) and ip is primitive, consider a linear combination 
of the Eisenstein series for r(N), 

u— lv— 1 u— 1 



Gt v (r) = EEE^( d ) G i cM+CT) (-)- 

c=0 d=0 e=0 

Let 7 be an element of r a (N) and consider the transformed series Hfc, 

c=0 d=0 e=0 

Writing 7 = " 7 d 7 € r 0 (N), it is straightforward to compute (Exer- 



cise 4.5.1(c)) that (cv, d + ev)j — {civ, dl + e'v), where c', e' e {0, . . . , u — 1} 
and d 1 € {0, — 1} are defined by the conditions d = ca^ (mod u) 
and d' = dd 7 (mod w) and e' = (e + c'6 7 )c? 7 (mod u). Also, ip(c)(p(d) = 
{il)tp){d 1 )il)(c!)(p(d'). It follows that 

and eMk(N,il><p). 

Now we compute the Fourier expansion of Gf'^. Since the nonconstant 
part of G% (t) rewrites as 

^ 2 sgn(m)m fe - 1 ^ m 9r, 

mn>0 
n=c v (N) 

the nonconstant part of G^' v {t) is (remembering that v denotes a vector in 
the previous expression and denotes the conductor of <p in the next one) 

pEE&W^) E sgn(m)m fe -Vr CT)m 9r 

c=0 d=0 e=0 mn>0 
n=cv (N) 

, y u— lv— 1 / u—1 \ 

= ^IEE^ C )^) E sgn(m)m fe -Vl ro E< W 

c=0 d=0 mn>0 \e=0 / 

n=cv (N) 

The inner sum is 0 unless w | m, in which case it is u. The third sum is empty 
unless v \ n. So, replacing m by am and n by vn and letting /x„ = e 27Tl ^ v , this 
is 
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« u—l v—1 

^EE^ c M d ) E sgn(m)m fe - 1 ^ m ? m " 

c=0 d=0 mn>0 
n=c (u) 

= §rX>( c ) E sgn(m)m fe - 1 (£^(d)^'"U'"". 

c=0 mn>0 \d=0 / 

n=c (u) 

The inner sum is ip(m)g((p) by formula (4.11). (This is where the calculation 
requires <p to be primitive. Note also that since ip is primitive the Gauss sum 
is nonzero.) So now the sum is 

Cfe ^I>( c ) E sgn(™Mm)m fe -y- 



IT 

c— 0 mn>0 
n=c (u) 



ip(n)sgn(m)tp(m)m k 1 q mn . 



mn>0 



Using the fact that (i()ip)(— 1) = (— l) fc and absorbing one more constant shows 
that the nonconstant part of Gf' v (r) is 



2 -°4^ E ^w^v 



m,n>0 



= C ^ 2 E E ^w™)^™)™'^ 

ra=l 1 m|n 

\m>0 / 



Meanwhile, the constant part of G^' v (t) is 

it— Id— lit— 1 v — 1 u — l 



E E E ^(dw^c*^*) - m E E ^)c d+eu (fc)- 

c=0 d=0 e=0 d=0 e=0 

Since ip(0) is 0 unless u = 1, we may take u = 1 in continuing, and the 
constant term is 

^( 0 )£^)c 5 (fc) = ^(o)E^) E ^ = #)E? 

d=0 d=0 m=d(v) m^O 

= 2^(0)L(fc,$>), 

where for the last step, 1) = (— \) k when t/j is trivial and both sides 
are 0 otherwise. When k is even, the functional equation (4.18) with v and 
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(p in place of N and \ an d formula (4.16) show that the constant term is 
(Exercise 4.5.2(a)) 

When k is odd, the functional equation (4.19) and formula (4.16) show the 
same relation (Exercise 4.5.2(b)). So finally, 

Theorem 4.5.1. The Eisenstein series G^' v takes the form 
where E^ ,Lp has Fourier expansion 



2nir 



71=1 

-ffere <5(-0) is 1 if ip = li and is 0 otherwise, and the generalized power sura 
in the Fourier coefficient is 



m k -\ 



m\n 
m>0 

For any positive integer N and any integer k > 3, let Ajv,fc be the set of 
triples (ip, ip, t) such that ip and ip are primitive Dirichlet characters modulo u 
and v with (ipcp)(— 1) = (— l) fc , and t is a positive integer such that tuv \ N . 
Then (Exercise 4.5.3) \A Njk \ = dim(£ fc (ri(iV))). For any triple (ip,<p,t) £ 
Ajvfc define 

^■*(r)=^(tr). 

Now that iV no longer necessarily equals uv, we need to recognize that our 
calculations actually showed that E^'^ £ M. k (r\(uv)) and all the congru- 
ences were taken modulo uv. By the observation at the end of Section 1.2, 
it follows that Ef'^'* g M. k (r\(tuv)) , and since tuv \ N it follows that 
Et'^ £ M k (r,(N)). 

Theorem 4.5.2. Let N be a positive integer and let k > 3. The set 

{Et'^ :W>,<p,t)eA Ntk } 
represents a basis of £ k (ri(N)). For any character \ modulo N, the set 

represents a basis of £ k (N, \) ■ 
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As a special case, when x = 1» we have a basis of £k(r 0 (N)). We skip 
the details of proving this theorem and its pending analogs for k = 2 and 
k = 1. Again the actual basis elements are cosets for now, but once £ k (r(N)) 
is redefined in the next chapter as a subspace of Mk(r(N)) the Eisenstein 
series themselves will make up the basis. 

The Eisenstein series of Theorem 4.5.1 will be revisited at the end of the 
book in a context encompassing the Modularity Theorem. 

Exercises 

4.5.1. (a) Check that J2de(z/NZ)* E^'^ is an element of M k (r 0 (N)). 

(b) Check that Ede(z/Nzy x(d)E { ^ d) £ M k (N, X ). 

(c) Verify the calculations showing that Gf'^ £ A4 k (N,iptp). 

4.5.2. (a) Show that when k is even, the constant term of Gf' v (r) is 
(C k g(v)/v k )iP(0)L(l-k,tp). 

(b) Show that when k is odd, the constant term of G^' v (t) is given by the 
same formula. 

4.5.3. Let A N ^ k be the set of triples (ip,tp,t) defined in the section and let 
Bn,u be the set of pairs (ip' , <p') such that ip' and tp' are Dirichlct characters 
modulo u' and v' , not necessarily primitive, with (ip'(p')(— 1) = (— 1) and 
u'v' = N. For such a pair, let u be the conductor of ip' , let ip be the corre- 
sponding primitive character modulo u, and similarly for v and tp. Conversely, 
for a primitive character ip modulo u and u' a multiple of u, let ip' u , be the 
corresponding character modulo u' , and similarly for ip' v , 

(a) Show that the map from An^ to Bm.r taking (ip, tp, t) to (ip' tu , <^' N /u u \) 
and the map from Bm,u to An^ taking (ip',p') to (ip,p,u'/u) invert each 
other. Thus |Aw,fc| = |-Bjv,fc|- 

(b) Show that the number of pairs (ip' , tp') satisfying the membership con- 
ditions for B]y k except the parity condition on ipip is X)d|7V 4>(d)4>(N/d). 

(c) Use Lemma 4.3.2 to show that \B Ntk \ = dim(f fc (r 1 (7V))). (A hint for 
this exercise is at the end of the book.) 

4.6 Eisenstein series of weight 2 

This section describes £2(B(N)), £ 2 (r 1 (N)), and the eigenspaces £2(N,\), 
including £ 2 (r Q (N)). Recall from Section 1.5 that for any v £ (Z/NZ) 2 of 
order TV the function 

t vf n 1 ( c vT +d v \ 
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is weakly modular of weight 2 with respect to r(N). The methods of Sec- 
tion 4.2 give the Fourier series, so briefly, letting 0 < c v < N for convenience, 
the leading term and the terms for c = 0 sum to 



-y' 



a 

1 ^2 ™ IN N 



5(c,)C d "(2) + (1 - <5(c,))-§ VN m lN m ~ ^2 2 C(2), 



and the terms with c > 0 sum to 



EE 



]ylE( E sgn(m)m^ ro )^-^E<^"' 

n=l m|n «=1 



m<0 

and the terms with c < 0 sum to 



^yy 1 

N 

OO OO 



1 

c<0 d£Z \ { ~ 'jv ' 

OO 

X! sgn(m)m^ m )^-^^a(n)^ 



jy2 Z^Z^ \ / c „r+d„ cr ^2 ( cr + d )2 



<^2 \ ~* / \ ~> , s « ^2 

TV 2 " 

ra=l m|n n=l 
m>0 

oo 



-(!-^)^E^ m ^ m - 

m— 1 

In these calculations, C*2 = (— 27T2) 2 /_T(2) and <r(n) = X!o<rf|n ^- total, 
/J( r ) = ^IC 7 ") — G 2 (t)/N 2 where 

„ OO 

Gf(r) = 5(^(2) + ^ 9* = e 2 "^, (4.23) 

the terms having the same meaning as in Theorem 4.2.3, and Gi(j) is the 
weight 2 Eisenstein series from Section 1.2, 

OO 

G 2 (r) - 2C(2) + 2G 2 ]T <r(n)g n , q - e 2 " T . 

n=l 

Recall from Section 1.2 that the function G 2 (t) — 7r/Im(r) is weight-2 invariant 
with respect to SL 2 (Z). So instead of f% , the function 
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9l{r) = Gl{r) - N2lm{r y ve(Z/NZ) 2 of order TV (4.24) 

is weight-2 invariant with respect to r(N). Thus g 2 is the series of Theo- 
rem 4.2.3 extended to k — 2 except that now it also includes a nonholomor- 
phic correction term. The Fourier coefficients of g 2 (t) are bounded by Cn 2 
(Exercise 4.6.2). 

The formula dim(£ 2 (-0) = £ co — 1 in (4.3) at the beginning of this chapter 
shows that as v runs through a set of cusp representatives for r(N), the set 
{g 2 } has one too many elements to represent a basis. The set of differences 

{.92 1 -92^92 -92 \ ■ ■ ■■,92^ -ff2 e }> where e = and vi, . . . , v £ represent 
the cusps of r(N), does represent a basis though we omit the proof. These 
differences are modular forms since they are holomorphic and weakly modular 
and their Fourier coefficients are small enough. Thus £ 2 (r(N)) is the set of 
linear combinations of g 2 whose coefficients sum to 0, canceling the occur- 
rences of the correction so that we can use the G 2 instead, subject to the 
most symmetrical constraint, 

Theorem 4.6.1. 



\ V V ) 



where the sums are taken over vectors v of order N in (Z/7VZ) 2 . 

Moving on to £ 2 (A(-W)) and its eigenspaces, let and p be Dirichlet 
characters modulo u and v (so v is no longer a vector) with uv — N and 
(i/)ip)(—l) = 1 and p primitive, and consider the sums 

u— 1 v— 1 u— 1 



Gf v (r) = E E E ^)p(d)G^ d+ ^(r), 

c=0 d=0 e=0 

oo 

Ep v (r) = 5W)L(-1, ip) + 2 E 4'*{n)q n , q = e 2 ^ . 



n=l 



Here G 2 cv ' d+ev \T) is from (4.23), and 5(ip) and af' v (n) have the same mean- 
ings as in Theorem 4.5.1. If cither ip or <p is nontrivial then the coefficients 
sum to 0 in G 2 ' v (t), and as in Section 4.5 

Gp v e M 2 (N,^), Gp v (r) = ^M^^( r ). 

When ip and ip are both trivial no sum G ! 2 1 i„,i t , (r) is a modular form. But for 
any positive integer t (Exercise 4.6.3), 

G 2 ,i 1 ,i 1 (t) - tG 2MM {tr) = ^(E 2MM {t) - tE 2MM {tr)) 

\ (4-25) 
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where G 2j t <S M 2 (Fo(t)) is the modular form from Section 1.2. 

Let .Ajv,2 be the set of triples (ip,ip,t) such that "0 an d are primitive 
Dirichlet characters modulo u and v with (ip(p)(— 1) = 1, and f is an integer 
such that 1 < tuv \ N. Note that the triple (li, li, 1) is excluded from Am,i- 
Checking formula (4.3) shows that this makes |^4jv.2 | = dim(£ 2 (r\(N))) . For 
any triple (ip,ip,t) e Am,2 define 



represents a basis of £ 2 (ri(N)). For any character \ modulo N , the set 



represents a basis of £ 2 {N,x)- 
Exercises 

4.6.1. Confirm the Fourier expansion of f 2 ■ 

4.6.2. Show that the nth Fourier coefficient of g 2 (r) is bounded by Cn 2 for 
some constant C. 

4.6.3. Prove equations (4.25). 

4.7 Bernoulli numbers and the Hurwitz zeta function 

Calculating the weight 1 Eisenstein series requires a generalization of the 
Bernoulli numbers introduced in Exercise 1.1.7. The Bernoulli numbers arise 
naturally in the context of computing the power sums 




li- 



Theorem 4.6.2. Let N be a positive integer. The set 



{^:(^, ft i)€4,2} 



{Ep^ : (1>,<p,t)€A Ni2 , ^ = X } 




o 



n, 



l 1 +2 1 + --- + n 1 



-(n 2 + n), 
l(2n 3 + 3n 2 + n) 



l 2 + 2 2 + --- + n 2 



etc. 



To study these, let n be a positive integer and let the fcth power sum up to 
n — 1 be 
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n-1 

S k (n) = ™> k , k e N. 

m=0 

Thus So(n) — n while for k > 0 the term 0 fc is 0. (Having the sum start 
at 0 and stop at n — 1 ncatens the ensuing calculation.) The power series 
having these sums as its coefficients is their generating function, an idea we 
encountered back in Section 1.2, 

OO f. 

S(n,t) = J2S k (n)-. 
k=o K - 

Rearranging the double sum gives (Exercise 4.7.1(a)) 

e nt _ 1 f 

S(n,t)=— — - r -. (4.26) 
t e l — 1 

The second term is independent of n. Its coefficients are the Bernoulli numbers 
by definition, constants that can be computed once and for all, 

t °° t k 

^\ = ^ Bk kV 

k=0 

Now the generating function rearranges to (Exercise 4.7.1(b)) 

1=1 3=0 J - 




(4.27) 



Thus, defining the fcth Bernoulli polynomial as 

3=0 V/ 

which again can be computed once and for all, comparing the first and last 
expressions for S(n,t) shows that the fcth power sum is 

S k (n) = ^-j-j-(5 fe+ i(n) - Bfc+i). 

As in Exercise 1.1.7, B 0 = 1, Bi = -1/2, and B 2 = 1/6, so the first few 
Bernoulli polynomials are B 0 (X) = 1 and 

B 1 (X)=X- 1 -, B 2 (X) = X 2 -X+ 1 -, B 3 (X)=X 3 - 3 -X 2 + 1 -X. 
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For example, l 2 + 2 2 H h n 2 = S 2 {n + 1) = (B 3 (n + 1) - £3) /3 works out 

to (2n 3 + 3n 2 + n) /6 as it should. 

To define more general Bernoulli numbers, note that the Bernoulli poly- 
nomials also have a generating function (Exercise 4.7.2(a)), 



te^ 
- 1 ~ 

fe=0 



Let u be a positive integer and let V : Z/mZ — ► C be any function, not 
necessarily a Dirichlet character. The Bernoulli numbers of are determined 
by a generating function, 

E^ c )^n = E^4- (4 - 29) 

c=0 fc=0 

In particular, Sfe.ii = -Bfc- Substituting the generating function of the 
Bernoulli polynomials in the left side of this relation shows that (Exer- 
cise 4.7.2(b)) 

u-l 



B k ^ = u*- 1 J2 ^(c)B k (c/u). (4.30) 

c=0 

In the next section we need (4.30) specialized to k = 1, 

5>( c ) L-9 =Bi ^ (4 - 31) 



c=0 



The Hurwitz zeta function is 
00 ^ 

C(s, r) = V 7 , 0 < r < 1, Re(a) > 1. 

">v > ( r + n \s - w 

n— 0 

In particular, £(s, 1) is the Riemann zeta function, and if r takes the form 
r = d/N with 0 < d < N (both integers) then ((s, r) = iV s C+(s) where C+(s) 

is the modified zeta function of Sections 4.2 and 4.4. 

e -rt 00 

Let f r (t) = — = ^2 e -( r + n ) t for t > 0 and consider the integral 

6 n=0 

/■°° dt 
9r(s) = / 

Since / r (£) grows like 1/t as i — > 0 + the integral converges at its left end 
when Re(s) > 1, and since f r (t) decays exponentially as t — > 00 the integral 
converges at its right end for all values of s. The rapidly converging sum 
passes through the integral (Exercise 4.7.3), and since dt/t and the limits of 
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integration are invariant when t is replaced by t/(r + n), the integral works 
out to (Exercise 4.7.4) 

g r (s) = r(a)C(a,r), Re(s) > 1. 

This integral representation of a well understood factor times the zeta function 
is a Mellin transform, an idea that will be developed further in Section 4.9. 
To connect the Hurwitz zeta function with Bernoulli numbers, consider the 
function 

fr{t) = -tfr(-t) = , t < 0. 

e l — 1 

If tp is a Dirichlet character modulo u then f c / u (ut)/u is the summand in the 
generating function (4.29) for {B^^}- Returning to a general value of r, since 
dt/t is negated when t is replaced by —t and since f r {—t) = f r (t)/(—t) for 
t > 0, the Mellin transform is also 

f° ~ dt 
9r(s)= Mt){-t)'T> Re (*)>l. ( 4 - 32 ) 

Jt=-oo 1 

Similarly to Exercise 4.4.5, take the nonpositive real axis as a branch cut in 
the z-plane so that arg(z) g (— 7r,7r) on the remaining set and the function 
log(z) = In \z\ + i arg(z) is single- valued and analytic there. For any z off the 
branch cut and any s g C define z s = e si ° 6 ( z \ For any positive real number 
e, let 7 e be the complex contour traversing the "underside" of the branch cut 
(where arg(z) = — tt) from — oo to —e, and then a counterclockwise circle of 
radius e about the origin, and finally the "upperside" of the branch cut (where 
arg(z) = 7r) from — e to — oo. Consider the complex contour integral 

This integral converges, and the rapid decay of the integrand as z — > — oo 
shows that the integral defines an entire function of s. Comparing the z s_1 
in the integrand against the (— t) s_1 in (4.32) as z traverses the negative 
axis back and forth shows that the integral is — 2i sm(Tr s)r(s)((s, r) when 
Re(s) > 1 (Exercise 4.7.5). Thus the function 

2tsm(Trs)r(s) J 7e z 

is a mcromorphic continuation of ((s, r) to the full s-plane. By formula (4.15) 
the continuation is 

a/ \ ^(1 ~ s ) f ; , \ s-idz „ 
CM = -A 27rf 1 Jjr(z)z° 1 -, sec. 

This lets us evaluate L-functions at nonnegative integers. Let s = 1 — k for 
a positive integer k and let ip ^ 1\ be a Dirichlet character modulo u. Then 
on the one hand, 
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V>(c)C(l - k, c/u) = u l - k J2 1>(c)$.(l - k ) = u l - k L{\ - k, # 



c=l 

But on the other hand, 



c=l 



jr^c)((l-k,cM = -f2i>(c) I ^l [ fc/u(z) 
c=l c=l T e 



"le c=l 



The integrals along the two straight segments cancel. Replace z by uz and 
apply Cauchy's integral formula to the remaining integral around the circle 
to get 



Ewc(i-m/«) 



c=l 



2m 



7» \c=l 

fc/u(uz) 



yk+1 



\c=l 



— u 



1-fc 



2=0 



Therefore L(l — k,ip) = —Bk.^/k for k > 1 just as we saw that ((1 — fc) 
—B k /k for fc > 2 earlier. We need the result for k = 1 in the next section, 



= -L{0,i/>) if ^ ^ 1 1 - 



(4.33) 



For more on the topics of this section see [Lan76], Chapters 13 and 14. In 
particular, the functional equations (4.18) and (4.19) for L-functions follow 
quickly. These are also presented as a series of exercises in [Kob93] . 



Exercises 

4.7.1. (a) Establish formula (4.26). 
(b) Establish formula (4.27). 

4.7.2. (a) Establish formula (4.28). 
(b) Establish formula (4.30). 

4.7.3. Let 0 < r < 1 and let Re(s) > 1. Show that the sum J2n=o e -(r+n)t t s-i 
converges absolutely. Show that the sum J2 n f~ e~^ r+n ^ t t s ~ 1 dt converges. 
Show that the integral of the sum J ^ n e -( r + n ) t t s ~ 1 dt is equal to the sum 
of the integrals ^ n J t e~( r+ ™)*t s_1 cii. (A hint for this exercise is at the end of 
the book.) 



4.7.4. Compute that g r (s) = r(s)((s,r) when Re(s) > 1. 
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4.7.5. (a) Show that the contour integral in the section is independent of e 
when 0 < e < 1. 

(b) For Re(s) > 1, show that as e — > 0 + the sum of the two linear pieces 
of the contour integral in the section converges to — 2i sm(n s)r(s)((s, r). 

(c) For Re(s) > 1, show that as e — > 0 + the circular piece of the contour 
integral converges to 0. (A hint for this exercise is at the end of the book.) 

4.8 Eisenstein series of weight 1 

Just as the Weierstrass p-function defines weight 2 Eisenstein series, another 
function associated with a lattice A leads naturally to series of weight 1 . The 
natural starting point is the Weierstrass a-function 



Since this function has simple zeros at the 2-dimensional lattice A c C just 
as sin irx has simple zeros at the 1-dimcnsional lattice Z c R, it is named "cr" 
by analogy. The exponential factors are needed to make the infinite product 
converge (Exercise 4.8.1(a)). 

The logarithmic derivative a' /a (Exercise 4.8.1(b)) is the Weierstrass zeta 
function, denoted Z here to avoid confusion with the Ricmann zeta, 



This function has simple poles with residue 1 at the lattice points, analogously 
to the logarithmic derivative tt cot nx of sin irx, but it isn't quite periodic with 
respect to A. Instead, since Z' A = —p A is periodic, if A = wiZ®u 2 Z then the 
quantities 

rn(A) = Z A (z + Wi) - Z A (z) and rj 2 (A) = Z A (z + u 2 ) - Z A (z) 

are lattice constants such that 

Z A (z + mu>i + n 2 uj 2 ) = Z A (z) + nir/i(A) + n 2 r/ 2 (A), ni,n 2 G Z. 

Under the normalizing convention uji/uj 2 G % the lattice constants satisfy 
the Legendre relation rj 2 {A)LOi — r)i(A)uj 2 = 2ni (Exercise 4.8.2). The second 
lattice constant appears in the g-product expansion of a specialized to A = A T 
(proved in [Lan73], Chapter 18), 





1 



OO 



(1 - e 27Tiz q n ){\ - e 
(1 - q n ) 2 





where as always, q — e 27TlT . The logarithmic derivative is therefore 
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ZaM =V2(A t )z- m i _ e27itz - 2m f 1 _ ^ ~ 1 _ e _ 2ff » n j ■ 

n=l ^ ' 

[Lan73] also shows that r/ 2 (A T ) = G2(t) (see Exercise 4.8.3), and so the Leg- 
endre relation shows that f]i(A T ) = tG2(t) — 2iri. 

For any vector v G (Z/NZ) 2 of order N, the function of modular points 
Ff(C/A, {u x /N + A, uj 2 /N + A)) 

,'cyOJi + d v u> 2 \ c v t]i(A) + d v i] 2 (A) 

— 6 A 



N ) N 

is well defined (Exercise 4.8.4(a)) and degree-1 homogeneous with respect 
to r(N) (Exercise 4.8.4(b)). The corresponding function (divided by N for 
convenience) 

1 fc v r + d v \ c v rji(A T ) + d v rj2(A T ) 



V, N 1 ry ( C v T+d v \ 



iV 2 



is weakly modular of weight 1 with respect to r(N). Letting z — (c v r + d v )/N 
in the last expression for Za t and taking 0 < c v < N for convenience, a 
calculation similar to the weight 2 case now shows that the terms of g\ (t) are 
(Exercise 4.8.5) 

m(A T )z _ c v r)i{A T ) + d v ri 2 (A T ) _ 2-kic v 
N N 2 ~ N 2 

and 

m 1 + e 27 " z 
N 1 - e 27 " z 

t./_ x 7f , /Trd v -. j TTl C\ v-^ 

5{cv)^cot ( — ) + (1 - ^c)) I -— + — 2^ M f/x 



m — 1 



and 



iV ' 1 — e 27Tiz q n 

n=l 



^ ]T( £ sgn(m)/# m )g& - (1 - S(c v )^ 

n=l m\n m=l 



m>0 



and 



9-tt-i °° „-27riz„n A 00 

f E r^?^ = § E( E 



m<0 
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Assembling the results gives 

gl{r) = G?(r) - § (| - 0 , 0 < c„ < JV, (4.34) 

where by using formula (4.22) and defining cr^n) as in Theorem 4.2.3, 

„ oo 

G\(t) = 6(c v )( d "(l) + <^n- ( 4 - 35 ) 

n=l 

Thus the series G\ is again analogous to G v k for fc > 3. Since (t) is holomor- 
phic and weakly modular with respect to r(N) and its nth Fourier coefficient 
grows as Cn, it is a weight 1 modular form with respect to r(N). 

Note that G ( ^^\t) = -G ( ^^\t) when c v ^ 0, and Gf'~ dv \r) = 

— Gi dv \r) (Exercise 4.8.6), consistently with the dimension of £i(r(N)) 
being Eoo/2 in formula (4.3) rather than 8^. 

Let ip and f be Dirichlet characters modulo u and v (again v is no longer a 
vector) with uv = N and ip primitive and (tpip)(— 1) = —1. As in Sections 4.5 
and 4.6, consider the sums 

u—lv—lu—l 



Gt - v (r) = e £ £ ^M^r^V), 

c=0 d=0 e=0 

oo 

= j( V )L(o, v) + mm <p)+zJ2 a o' ip ( n )i n ■ 

n=l 

Here d(ip) and ^'^(n) have the same meanings as in Theorem 4.5.1. Part 
of the constant term of Gf'^^r) comes from summing the correction term 
from (4.34), 

U— 1 y \ V— 1 It— 1 



iV ^-^ V iV 2, 



This is 0 unless <p = li, in which case u — N and V> is nontrivial and /u = 
1, and so it works out to 



c=0 



^%)£(0,^) by (4.33). 



As in the calculation for Theorem 4.5.1, the rest of the constant term 
of Gf v (r) is 

u— 1 v— 1 u— 1 V— 1 

E E E ^(cM^mc^i) = v>(o) E ^K'a) 

c=0 d=0 e=0 d=0 
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where in the second step we may assume tp = li. This makes tp(— 1) = — 1 
and the sum is the analytic continuation as s — >• 1+ of 

J2?(dK d (s) = Yl £' mn- s = (i-(-i)- s )L( S ^). 

d=0 d n=d(v) 

So the rest of the constant term is 25(tp)L(\, tp), and as in Section 4.5 this is 
(C\g(tp)/v)5(tp)L(Q, tp). The constant term thus totals 

^M(^)L(0,V)+^)L(0,^)). 
The rest of the calculations in Section 4.5 go as before, giving 

Gf' v G Mi(N, iP<p), Gi' v {r) = °±9M E f .*>( T ). 

w 

Let Aat.i be the set of triples (1^, such that ^ and <p, taken this time 
as an unordered pair, are primitive Dirichlet characters modulo u and v satis- 
fying the parity condition (tptp)(— 1) = — 1, and t is a positive integer such that 
tuv | N. The parity condition shows that contains no triples ({ip, tp}, t) 
with the same character twice, so taking the characters in unordered pairs 
means that ^a^i contains half as many elements as it would otherwise, and 
formula (4.3) shows that again | j4 jv, i | = dim(£i(/\(iV))). Since the constant 
term and the Fourier coefficients of Ef ' ,v are symmetric in tp and tp, the series 
depends on the two characters only as an unordered pair, and it makes sense 
to define for each triple ({tp, tp}, t) € Ajv,i 

Ef'^'^T) = Ef'*(tr). 

Theorem 4.8.1. Let N be a positive integer. The set 

{Ef^ :({tP,tp},t)&A N>1 } 

represents a basis of £i(Fi(N)). For any character \ modulo N, the set 

{Ef"'* :({tP,tp},t)eA Nll , tptp = x } 

represents a basis of £\(N,\). 

This theorem combines with Theorem 4.5.2 to give a solution to the 
s squares problem (cf. Section 1.2) for s = 2,6,8 along with the solu- 
tion we already have for s = 4. For any even s > 10 the same methods 
give an asymptotic estimate f(n,s) of the solution r(n,s), meaning that 
lim n _ i . 00 f(n, s)/r(n, s) = 1. See Exercise 4.8.7 for the details. 

A weight 1 Eisenstein series Ef' 1 figures in the proof of the Modularity 
Theorem. This will be explained in Section 9.6. 
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Exercises 

4.8.1. This exercise touches briefly on properties of the infinite product <ja{z)- 
Since the general theory of infinite products is not assumed and will not be 
explained here (see, for example, [JS87]), the results are necessarily informal. 

(a) Using the principal branch of the complex logarithm, show that for 
0 < \z\ < min{|w| : oj £ A, uj ^ 0}, the sum 

]T'iog((i-zA,) e z /"+^/") 2 ) 

converges absolutely. It follows that the infinite product <ja(z) converges to a 
holomorphic function. 

(b) Show that if / = ]T^=i fn then /'// = Yln=i fJ in- The analogous 
result for infinite products holds for the functions a and Z. 

4.8.2. Prove the Legendre relation 772(^)^1 — ?7i(/l)u>2 = 2-7ri. (A hint for this 
exercise is at the end of the book.) 

4.8.3. The actual formula for r]2(A T ) in [Lan73] is 

\ m=l 1 / 

Prove that the right side is G^if). 

4.8.4. (a) Verify that the function is is well defined. 

(b) Verify that the function is degree-1 homogeneous with respect to -T(iV). 

4.8.5. Verify the calculations leading to formula (4.34). 

4.8.6. Show that g[~ Cv4v) (t) = -G[ Cv ' dv \r) when c v ^ 0, and show that 

4.8.7. (a) Recall the function 9(t, k) from Section 1.2. Similarly to that sec- 
tion's argument that 6>(r,4) e 7W 2 (-fo(4)), show that 6>(t,2) e M\{ri(A)) and 
consequently 6(t, s) S A^ s /2(A(4)) for all even s > 2. 

(b) Use results from Chapter 3 to show that 5 S / 2 (A(4)) = {0} for s = 
2, 6, 8. It follows that 6*(r, s) is a linear combination of Eisenstein series for 
these values of s. Use results from Chapter 3 to compute dim(A1 s /2(A(4))) 
in each case. (Hints for this exercise are at the end of the book.) 

(c) Recall from Section 1.2 the notation r(n, k) for the number of ways of 
representing the positive integer n as a sum of k squares. Use Theorem 4.8.1 
to show that 

r(n,2)=4 ^ (-l)^" 1 )/ 2 . 

0<m\n 
m odd 
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Similarly, use Theorem 4.5.2 to find formulas for r(n, 6) and r(n, 8). 

(d) The proof of Proposition 5.9.1 to follow will establish that the Fourier 
coefficients of any cusp form / € S k (r) satisfy \a n (f)\ < Cn k / 2 for some C . 
Let s > 10 be even. Argue that the Fourier coefficients of the Eisenstein series 
in AA s /2{r\{A)) dominate those of the cusp forms in <S S / 2 (A(4)) as n — > oo, 
and so the methods of this exercise give an asymptotic solution to the s squares 
problem by ignoring the cusp forms. 

4.9 The Fourier transform and the Mellin transform 

The next section will explain an idea originally due to Hecke to get around 
the conditional convergence of C?2(t) and the divergence of the analogous 
scries Gi(t) in a much farther-reaching way than the methods so far. This 
will require a bit more analysis. As a warmup, the ideas are first used in this 
section to prove the meromorphic continuation and functional equation of the 
Riemann zeta function as discussed in Section 4.4. 

For any positive integer I, the space of measurable and absolutely inte- 
grable functions on R' is 

£ 1 (R i ) = {measurable / : R' — > C : / |/(ar)|da: < oo}. 

Any / € £ 1 (R / ) has a Fourier transform f : R' — > C given by 

/>)= / f{y)e- 2 ™ M dy, 

where ( , ) is the usual Euclidean inner product. Although the Fourier trans- 
form is continuous (Exercise 4.9.1), it need not belong to £ 1 (R'). But if also 
IxeR 1 \f( x )\ 2 d x < 00 then J x£Ri \f(x)\ 2 dx < oo, see for example [Rud74]. 
The I- dimensional theta function #( , I) : % — > C is 

0( T ,l) = e™ |n|2r , ren 

nez 1 

where | | is the usual Euclidean absolute value. The sum converges very 
rapidly away from the real axis, making absolute and uniform convergence on 
compact subsets of % easy to show (Exercise 4.9.2) and thus defining a holo- 
morphic function. Specializing to r = it with t > 0 and letting / G £ X (R ( ) 
be the Gaussian function f(x) = e _7r l x l gives $(it,l) = X^nez 1 /( n ^ 2 )> a 
sum of quickly decreasing functions whose graphs narrow as \n\ grows. (See 
Figure 4.1.) 

We need the Fourier transform of the Gaussian f(x). First set I = 1, call 
the function <p in this case, and compute 
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0.2 0.4 0.6 0.8 1 
Figure 4.1. The first few terms of 1) and their sum 

/OO r-OO 
e -n(y 2 +2iyx~x 2 ) e -KX 2 dy ^ e -nx 2 / e^V+^dy. 
a = — oo ^/y— — oo 

Complex contour integration shows that the integral is just the Gaussian 
integral J^° oo e~ 7:v dy (Exercise 4.9.3(a)), and this is 1 (Exercise 4.9.3(b)). 

Thus / = / when I = 1. To prove the same result for I > 1, note that / 
takes the product form f(x) — ip(xi) ■ ■ ■ (fi(xi) for any vector x — (x\, . . . ,xi), 
and consequently the Fourier transform is f(x) = tp(xi) ■ ■ ■ <p(xi), i.e., / = / 
as desired (Exercise 4.9.3(c)). Also, for any function h G £ 1 (R') and any 
positive number r, the Fourier transform of the function h(xr) is r~ l h{xjr) 
(Exercise 4.9.3(d)), so in particular the Fourier transform of f{xt 1 / 2 ) is 
t-^fixt- 1 / 2 ). 

For any function h e £ 1 (R / ) such that the sum J2dez l H x + d) converges 
absolutely and uniformly on compact sets and is infinitely diffcrentiable as a 
function of x, the Poisson summation formula is 

h(x + d) = h{m)e 2ni{m ' x) . 
dez' mez 1 

For example, formula (1.2), which we have used to compute the Fourier 
expansions of Eisenstein series, is a special case of one-dimensional Pois- 
son summation known as the Lipschitz formula (Exercise 4.9.4). Letting 
h(x) be the Gaussian f{xt 1 / 2 ), Poisson summation with x = 0 shows that 
E/(ni 1 / 2 )=t-'/ 2 E/(^ 1/2 ) I i-e., 

d{i/t, I) = t l/2 d(it, I), t > 0. (4.36) 

By the Uniqueness Theorem from complex analysis this relation extends to 

0(-l/r, J) = Hr)'/ 2 tf(T, I), ren. 

Here — it lies in the right half plane, and the square root is defined for all 
complex numbers except the negative reals, extending the real positive square 
root function of positive real numbers. This transformation law should look 
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familiar from Section 1.2 since the function that arose there from the four 
squares problem is 6(t, I) = i9(2r, I). 

The properties of the Ricmann zeta function are established by examining 
the Mellin transform of (essentially) the theta function. In general, the Mellin 
transform of a function / : R + — > C is the integral 

,dt 



poo 

g(s) = / 

Jt=o 



f(t)t° t 



for s-values such that the integral converges absolutely. (See Exercise 4.9.5 
for the inverse Mellin transform.) For example, the Mellin transform of e _t 
is r(s). Setting I = 1 and writing #(r) rather than i?(r, 1) for the duration 
of this section, consider the Mellin transform of the function Y^=i e" 7 ™ * = 
l/2(t?(ii) - 1) for t > 0, 

POO °° Jj. 1 POO 1, 

ff(-) = / E e— 2t t S J = \ / (*(<*) l)f y • (4-37) 
Jt=o n=1 r L Jt=o 1 

Since i?(if) converges to 1 as t — >• oo, the transformation law (4.36) shows that 
as t — > 0, i9(zt) grows at the same rate as i -1 / 2 , and therefore the integral 
g(s) converges at its left endpoint if Re(s) > 1/2. And since the convergence 
of "&(it) to 1 as t — >• oo is rapid, the integral converges at its right end for all 
values of s. Rapid convergence lets the sum pass through the integral in (4.37) 
to yield, after a change of variable, 

00 f 00 dt 

gis)^^™ 2 ^ 3 e-H s T = n- s r( S )a2 S ), Re(s) > 1/2. 

n=l Jt =° t 

Thus the integral g(s/2) is the function ^(s) = Tt~ s l 2 r{s/2)C > {s) from Sec- 
tion 4.4 when Re(s) > 1. That is, the Mellin transform of the 1-dimensional 
theta function is the Riemann zeta function multiplied by a well understood 
factor. Thinking in these terms, the factor TT~ s / 2 r(s/2) is intrinsically asso- 
ciated to C(s), making £(s) the natural function to consider. 

The second integral in (4.37) provides the meromorphic continuation and 
functional equation of £. Computing part of g(s/2) by splitting off a term, 
replacing t by 1/t, using the transformation law (4.36) for d(it) with 1 = 1, 
and splitting off another term gives (Exercise 4.9.6) 
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And combining this with the remainder of the integral g(s/2), 

« fl ) = 5jf^( < *)- 1 )( Wa +* (1 "' )/a )f -^-1^' Re(-)>1- (4-38) 

But the integral in (4.38) is entire in s, making the right side holomorphic 
everywhere in the s-plane except for simple poles at s = 0 and s = 1. And 
the right side is clearly invariant under s n- 1 — s. Thus the function £(s) has 
a mcromorphic continuation to the full s-plane and satisfies the functional 
equation £(1 — s) = £(s). The properties of the Riemann zcta function as 
stated in Section 4.4 now follow. 

Exercises 

4.9.1. Show that the Fourier transform is continuous. (A hint for this exercise 
is at the end of the book.) 

4.9.2. This exercise shows that the series i?(r, I) — J2nez l e "' n ' 2r is holomor- 
phic. 

(a) Let S m = {n e Z l : max{|m|, . . . , \ni\} — m} for each natural num- 
ber m. Note that \n\ 2 > m 2 for all n e S m . Show that |S m | < l(2m + l) 1 ^ 1 . 

(b) Let K C % be compact. Thus there exists some y n > 0 such that 
Im(r) > yo for all t e K. Show that for any r e K and n e S m , 

| e iri|n| 2 T| ^ e -7rm 2 y 0 

(c) There exists a natural number M 0 such that if m > M 0 then I (2m + 
^y-i < e m an( j nm^y^ > 2m. Show that for any t E K and any M > M 0 , 
the corresponding tail of the series i?(r, Z) satisfies 

oo 

J- | e H«| 2 r|< ^ e _ ro = e _ M(1 _ e -l ) -l_ 

n£Z' m=M 
\n\ 2 >M 

This is less than arbitrary e > 0 for large enough M independently of r, 
showing that the series i?(r, I) converges absolutely and uniformly on compact 
subsets of H. Since each summand is holomorphic, so is the sum. 

4.9.3. (a) Use complex contour integration to show that f£l_ 00 e^^^ 1 ^ dt 

equals e~ 7rt2 dt. 

(b) Show that this Gaussian integral is 1. (A hint for this exercise is at the 
end of the book.) 

(c) Show that if / G £ x (i?') takes the form of a product f(x) = 
fi{ x i) " " " fl{ x l) then its Fourier transform is the corresponding product 
f(x) = f 1 (x 1 )---f l (x l ). 

(d) Show that for any function h e £ 1 (R i ) and any positive number r, the 
Fourier transform of the function h(xr) is r~ l h(x/r). 
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4.9.4. Formula (1.2) is a special case of the Lipschitz formula for r e % and 
q = e 27TiT , 

"52 -, — —jr- = ^—^T\- m s ~ 1 q m 1 Re( S ) > 1. 

Prove the Lipschitz formula by applying Poisson summation to the function 
/ e C\H) given by 



0 if x < 0. 



(A hint for this exercise is at the end of the book.) 

4.9.5. If g is a holomorphic function of the complex variable s in some right 
half plane then its inverse Mellin transform is 



i rcr+i 



g(s)t s ds 



for positive t-values such that the integral converges absolutely. Complex con- 
tour integration shows that this is independent of a. 

(a) Let t = e x , s = a + 2niy, f(t) = e^ ax f a (x), and g(s) = g a {y). Show 
that 

9{s)= / f(t)t s - 4=^ g a {y)= / U(x)e 2 ™«dx 

Jt=0 1 Jx=-oo 

and that 

/(*) = Y~. / g( S )r s ds ^ U(x) = / g a (y)e-^y*dy. 

The right side conditions are equivalent by Fourier inversion, and so the left 
side conditions are equivalent as well. This is the Mellin inversion formula. 

(b) Evaluate the integral f^ 1 ™ r(s)x~ s ds for any a > 0. (A hint for 
this exercise is at the end of the book.) 

4.9.6. Verify the calculations leading up to formula (4.38). 



4.10 Nonholomorphic Eisenstein series 

Hccke put the Eisenstein series of low weights 1 and 2 into a larger context 
by augmenting Eisenstein series with a complex parameter s. We will study a 
variant of Hecke's original scries. Let N be a positive integer, let v be a vector 
in (Z/NZ) 2 of order N, let k be any integer, and let e N be 1/2 if TV e {1, 2} 
and 1 if TV > 2. Define 
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EUr,s) = e N ]T' {cT + d) l\ CT + d?s , r = X + l yeU. 



(c,d)=v (N) 
gcd(c,d) — 1 

This converges absolutely on the right half plane {s : Rc(fc + 2s) > 2}. 
The convergence is uniform on compact subsets, making £^(t, s) an ana- 
lytic function of s on this half plane. To rewrite the series in intrinsic form, 
let S = [ c " ] G SL 2 (Z) with (c v , d v ) a lift of v to Z 2 , and recall the positive 
part of the parabolic subgroup of SL 2 (Z), P + = {[q "] : n G Z}. Extend the 
wcight-fc operator to functions of r and s by the definition 

(fh]k)(T,s)=j( 1 ,T)- k f( 1 (T), S ), 7 eSL 2 (Z). 

Then 

El(r,s) = e N Yl Mr) s h]k- 

-te(P + nr(N))\r(N)5 

This formula makes it clear that (-Efc[7]fc)(Y, s) — ^ 7 (r, s) for all 7 e SL 2 (Z) 
as in Proposition 4.2.1, so that in particular 

Elb\k = El forall 7 er(7V). 

As earlier, the corresponding nonnormalized series 



{c,d)=v(N) 



G v k {r,s)= Y' -. -J — (4.39) 

^ (ct + d) k \cr + d\ 2s 



will be easier to analyze. Relations like (4.5) and (4.6) expressing each G\ as 
a linear combination of the E% and vice versa still hold (Exercise 4.10.1(a)), 
so Gy. transforms under SL 2 (Z) in the same way as E^. (Exercise 4.10.1(b)). 
The discussion leading to relations (4.20) and (4.21) generalizes to show that 
meromorphically continuing cither sort of Eisenstein series meromorphically 
continues the other (Exercise 4.10.1(c)). This section will show that G v k as 
a function of s has a meromorphic continuation to the entire s-plane. For 
s-values to the left of the original half plane of convergence Re(fc + 2s) > 2, 
the continuation is no longer defined by the sum, but the transformation law 
continues to hold by the Uniqueness Theorem from complex analysis. The 
natural Eisenstein series to consider is 

Gl(r,0) = Gt(r,s)\ s=0 . 

For k > 3 the parameter s simply appears and disappears, but for k < 2 it 
gives something new. Hecke's method for weights 1 and 2, obtaining the func- 
tions 3i (t) and 5 2 (t) from before, is presented in [Sch74], and a generalization 
of Hecke's method due to Shimura is presented in [Miy89] and [Hid93]. The 
method here uses a Mellin transform and Poisson summation, extending the 
ideas of the previous section. See for example Rankin [Ran39] , Sclbcrg [Scl40] , 
and Godement [God66]. 
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Working in dimension 1 = 2, view elements of R 2 as row vectors and define 
a modified theta function whose argument is now a matrix, 

0( 7 )= e-^\ 7 G GL 2 (R). 
nez 2 

Again the second variable from the general notation for z9 is omitted since I 
is fixed at 2. When 7 = It 1 / 2 where / is the identity matrix and t > 0, 
this is $(i£) from the previous section. To see how a more general matrix 
7 affects the transformation law for i9, compute that for any function / G 
£ 1 (R 2 ), any matrix 7 € SL 2 (R), and any positive number r, the Fourier 
transform of the function ip(x) = f(xjr) is ip(x) = r~ 2 f(xj~ T /r) where 
7~ T denotes the inverse-transpose of 7 (Exercise 4.10.2). In particular, again 
letting f(x) = e~^\ x \ be the Gaussian, Poisson summation shows that for 
7 G SL 2 (R), r£/(n 7 r) = r" 1 £ /(n^/r). Let S = [ ° - J] . Then S 7 " T = 
7 5 for all 7 G SL 2 (R) and \xS\ = \x\ for all x G R 2 , so f(nSj- T /r) = 
f(nj/r). As n varies through Z 2 so does nS, and thus the Poisson summation 
is r S/( n 7 r ) — r_1 S/( n 7/ r )- That is, 

n?(7r) = (l/r)i?(7/r), 7 G SL 2 (R), r > 0. (4.40) 

For any 7 G SL 2 (R), the Mcllin transform of Yln^z? e~^\ H = d^t 1 ' 2 )- 
1 for t > 0 is 

ff(*.7)= r ^'e-^Vf = f(tf(y/ 2 )-l)«4 ( 4 - 41 ) 

Again ?9(7i 1 / 2 ) converges to 1 as i — > 00, so the transformation law (4.40) 
shows that as t — > 0, i^i 1 / 2 ) grows as 1 /t, and therefore the integral converges 
at its left endpoint if Re(s) > 1. Also as before, the integral converges at its 
right end for all values of s, and rapid convergence of the sum lets it pass 
through the integral to yield after a change of variable, 



5 ( S ,7) = £ Mn 7 | 2 r s / e-H s - = n-°r( S ) £ |n 7 |- 2 " 



for Re(s) > 1. To connect this with Eisenstein series, for any point r = x+iy G 
H let 7 T be the corresponding matrix 



1 

7^ = ~F 



y x 
0 1 



G SL 2 (R). 



(This matrix, which transforms i to r, appeared in Chapter 2, in Exer- 
cise 2.1.3(c) and again in the proof of Corollary 2.3.4.) For n = (c,d) G Z 2 
compute that |n7 T | 2 = \(cy, cx + d)/yjy\ 2 = \cr + d\ 2 /y. Therefore the Mcllin 
transform is essentially the Eisenstein series, 
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g(s, 7t ) = n-T(a)^2' f = ir- s r(s)G 0 (T, a), Re(«) > 1, 

(c,d) 11 

where Go(r, s) is (4.39) with weight k = 0 and level N = 1. (In this case there 
is only one v G (Z/NZ) 2 , so it is suppressed from the notation Gq.) 

The second integral in (4.41) provides the meromorphic continuation and 
functional equation of Go- As in the previous section, the transformation law 
for i? shows that part of the integral is 



Jt=o r Jt=o r s 



0 (7t -l/2 )t -.* _ I 
i - I t s 



t=l f s 

Combining this with the remainder of the integral gives 

/■oo ,7+ 1 1 

g(s, 7 )= WTt^-W + ?-')- , Re(s)>l. 

Jt=i r s i s 

This integral is entire in s, making the right side holomorphic everywhere in 
the s-plane except for simple poles at s — 0 and s = 1. And the right side is 
invariant under s h- > 1— s. Specializing to 7 — j T , the function tt~ s _T(s)Go(t, s) 
for Re(s) > 1 has a meromorphic continuation to the full s-plane that is 
invariant under s i-> 1 — s. 

Suitably modified, this argument extends to higher weights and levels. Let 
N > 1 and let G be the group (Z/NZ) 2 . Rather than taking one vector v G G, 
consider any function 

a : G — > C. 

Of course, the function a could simply pick off a vector, but thinking about the 
entire group symmetrizes the result nicely. Let /in = e 2 ^ l l N , let S — [J 
and let ( ,) be the usual inner product on R 2 . The Fourier transform of a is 
a function a : G — > C, 



The resulting Fourier series reproduces the original function (Exercise 4.10.3) 
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and a = a (Exercise 4.10.3 again). 

Let k be a positive integer and let h k be the harmonic polynomial 

h k (c, d) = (-i) k (c + id) k , (c, d) e R 2 . 

Associate a theta function to each vector v G (Z/NZ) 2 , 

€(7) - E h k {{v/N + n) 1 )e-^ v ' N+n ^ 2 , 7 e GL 2 (R). (4.42) 

neZ 2 

Again letting f(x) — e^ 71 "^ , define the Schwartz function f k (x) to be a 
modified Gaussian, 

fk(x) = h k (x)f(x). 

Thus i?|( 7 ) = £„ eZ2 fk((v/N + n) 7 ). For 7 e SL 2 (R) and r > 0, compute 
that ri?fc(7r) = rJ2fk((v/N + n)~/r) = r^tp k {v/N + n) where (/Jfc(ar) = 
f k (x^r). The Schwartz function has Fourier transform j fe = {—i) k f k (Exer- 
cise 4.10.4(a)), and so (p k (x) = (~i) k r~ 2 f k (x'-f~ T r~ 1 ). Poisson summation 
over nS and the relations S"f~ T = jS and S T = — S give 

r$Uir) = {-ifr- 1 E MnS^r-^e 2 ^ 3 ^ 
nez 2 

= (-i) k r~ 1 E fkinySr-^e^-^l 
nez 2 

Letting z(x) = c + id e C for any x = (c,d) G R 2 makes h k (x) = 
{-i) k {z(x)) k = (z(xS)) k and thus h k (xS) = (-i) k h k (x) (Exercise 4.10.4(b)). 
Consequently f k (xS) = (—i) k f k (x) as well, and now 

r^( 7 r) = (-l) fe r- 1 E Mn^- 1 )^^ 
nez 2 

= (-i)*r- 1 E E a(^-W^ s> 

">eG „ e z 2 

ri=tu (AT) 

= (-l)V- 1 E E / fe ((W^ + n)7^- 1 )^ K, ' S> 
= (-l) fe r- 1 EC(7^r- 1 ) M ^'" S> . 

KG 

Thinking here of d v as a function of v, the sum is iV times this function's 
Fourier transform, giving rd v k {"{r) = {—l) k Nr , d k {^Nr~ 1 ). Since r > 0 is 
arbitrary, replace it by N 1 / 2 r to get 

r$t{ 1 N 1 /' 1 r) = {-l) k r- 1 dl(')N 1 l 2 r- 1 ), 7 e SL 2 (R), r > 0. (4.43) 

Thus (Exercise 4.10.5) 
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Proposition 4.10.1. For any function a : G — > C, the associated sum of 
theta functions 

mi) = £(«(«) + (-l) fc «(-^M(7^V 1/2 ), 7 G GL 2 (R) 

vEG 

satisfies the transformation law 

rO a k (jr) = r-'eKjr- 1 ), 7 G SL 2 (R), r > 0. 

The functional equation for the associated sum of Eisenstein series fol- 
lows as before. The constant terms of the series $£(7) are 0, so that O^i^r) 
converges to 0 rapidly as r — >■ 00. Consider the Mellin transform 

f°° rlt 

9t{s,l)= et^y-. (4.44) 
Jt=o 1 

The transformation law shows that the integral converges at t = 0 for all 
values of s, and as before it converges at its right end for all s as well. Passing 
the sums through the integral, noting that hk(xr) — hk(x)r k for r e R, and 
changing variable gives 

g a k { S n) = *- k,2 - s r{k/2 + s )N s 

+ (-l) k a(-v)) ]T' /i fe (n7)|n 7 |- fe - 2s . 

vEG n=v(N) 

Setting 7 = 7 T gives h^wy) — {z(wyS)) k — (cT + d) k /y k / 2 and gives 

| n7 |-*-2- = yk/2+s J\ CT + d] k+2s ^ with product y s/((cT + d) k \cT + d\ 2 ( s - k W). 

Thus for Re(s) > 1, 

7 r) - n- k ' 2 - s r{k/2 + S )7V s y fe / 2 G^(r, 0 - k/2) (4.45) 
where G£ is the associated sum of Eisenstein series (4.39), 

G%(t, s) = J2(a(v) + (-l) k a(-v))Gt(r, s), Re(fc/2 + s) > 1. (4.46) 

Proposition 4.10.1 shows that part of the integral (4.44) is 

Jt=0 r Jt=l r Jt=l r 

Combining this with the remainder of the integral gives 

/■°° <7f 

The integral is entire in s and invariant under s4 1 — s. Similar arguments 
for k = 0 and fc < 0 (Exercise 4.10.6) show that in all cases, 
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Theorem 4.10.2. Let N be a positive integer and let G = (Z/NZ) 2 . For any 

function a : G — > C, let G^(r, s) be the associated sum (4.46) of Eisenstein 
series. Then for any integer k and any point r = x + iy G %, the function 

(ir/N)- s r(\k\/2 + s)G%(t, s - k/2), Re(«) > 1 

has a continuation to the full s-plane that is invariant under s n- 1 — s. The 
continuation is analytic for k ^ 0 and has simple poles at s = 0,1 for k = 0. 

The function in the theorem comes from canceling the s-independent terms 
in the Mellin transform (4.45). Exercise 4.10.8(b-c) shows that sums of only 
two Eisenstein series satisfy functional equations for the larger group Ti(N). 

The meromorphically continued Eisenstein scries of this section have sev- 
eral applications. As already mentioned, continuing the series for weights k = 1 
and k — 2 to s = 0 recovers the Eisenstein series from earlier in the chapter. 
Also, the Rankin-Selberg method integrates an Eisenstein scries against the 
square of the absolute value of a cusp form, obtaining an L-function satisfying 
a functional equation. This idea was originally used to estimate the Fourier 
coefficients of the cusp form, but i-functions of the sort obtained by this 
method are now understood to be important in their own right. For more on 
this topic, see [Bum97]. 

A more subtle use of the meromorphically continued Eisenstein series is 
in the study of squarc-intcgrable functions called automorphic forms on the 
quotient space SL 2 (Z)\SL 2 (R), the arithmetic quotient. The space of such 
functions decomposes as a discrete part and a continuous part, the former 
like Fourier series and the latter like Fourier transforms. The discrete part 
contains cusp forms and a little more, the residual spectrum, so called because 
it consists of residues of Eisenstein series in the half plane to the right of the 
line of symmetry for the functional equation. The continuous part consists 
of integrals of Eisenstein series on the critical line against square-intcgrable 
functions on that line. Thus the explicit spectral decomposition cannot be 
stated at all without knowing the meromorphic continuation of Eisenstein se- 
ries. These ideas extend in interesting ways to matrix groups beyond SL 2 (R), 
where their original proofs often proceeded by establishing the meromorphic 
continuation en route. 

Exercises 

4.10.1. (a) Find relations like (4.5) and (4.6) expressing each G% as a linear 
combination of the E% and vice versa. 

(b) Verify that G v k transforms under SL 2 (Z) the same way as E%. 

(c) Show that meromorphic continuation of cither sort of Eisenstein series 
gives meromorphic continuation of the other. 

4.10.2. Let f(x) be a function in ^(R 2 ). For any 7 e SL 2 (R), and r > 
0, show that the function ip(x) = /(aryr) has Fourier transform <p{x) = 
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r~ 2 / (x-f~ T I V) where r y~ T denotes the inverse-transpose of 7. (A hint for this 
exercise is at the end of the book.) 



4.10.3. Show that a(u) = E a(v)fi ( ^' vS ' > for u g G and that a = a. (A 
hint for this exercise is at the end of the book.) 

4.10.4. (a) Let / g £ 1 (R'), let j e {!,..., I}, and define 



g : R l — > C, g(x) = Xjf(x) 



where Xj is the jth component of x. Suppose that g € £ 1 (R Z ). Show that 

Applying this identity repeatedly shows that the Schwartz function fk(x, y) = 
{—i) k {x + iy) k e~*( x +v ' has Fourier transform 

1 \ k ( d . d N ''' 



Switching to complex notation fh{z) = {—i) k z k e vzz , this is 



1 \ k ( d x 



dz 

Compute that this relation is fk = {—i) k fk as desired. 

(b) Show that h k (x) = (z(xS)) k and h k (xS) = (-i) k h k (x). 

4.10.5. (a) Show that for all 7 g GL 2 (R), 

E o(«)<(7) - E *(-«)^(7) ^d E S(-«)^(7) - E 

(A hint for this exercise is at the end of the book.) 
(b) Prove Proposition 4.10.1. 

4.10.6. (a) Define h 0 (x) = 1 for all x g R 2 . For TV > 1 and k = 0 define thcta 
series {i9q : v g G} by the formula in the section. Prove Theorem 4.10.2 when 
k = 0. 

(b) When k < 0 define ftfc(a;) = h-k(x) for x g R 2 , where the overbar 
denotes complex conjugation. Prove Theorem 4.10.2 when k < 0. (A hint for 
this exercise is at the end of the book.) 

4.10.7. What does Theorem 4.10.2 say when the function a picks off one 
vector, i.e., a — a w where a ¥ (w) is 1 if w = v and 0 otherwise? 
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4.10.8. (a) Let ip and ip be primitive Dirichlet characters modulo u and v 
with ftM(-l) = (-l) fe and uv = N. If a : (Z/NZ) 2 — > C is the function 

a(cv, d + ev) = tp(c)(p(d), a(x, y) = 0 otherwise, 

show that its Fourier transform is 

o(— cu, — (d + eu)) = (g((p) /v)g(ip)ip(c)'ip(d), a(x,y) = 0 otherwise. 

(Hints for this exercise are at the end of the book.) 

(b) Define an Eisenstein series with parameter, 

g^(t, s) = £ g E mm<%***°\T, s). 

c=0 d=0 e=0 

Show that the sum (4.46) of Eisenstein series for the function a in this problem 
is 

G£(t, a) = Oi»(T, s) + (-l) k {g^)/v)g(4>)Gt^(T, s). 

Thus the functional equations for the eigenspaces Ek(N,x) involve only two 
series at a time. 

(c) Define a function b : (Z/NZ) 2 — > C and a series Ef' v (T,s) by the 
conditions 

v v 
Show that the sum (4.46) of Eisenstein series for b is more nicely symmetrized 
than the one for a, 

4.11 Modular forms via theta functions 

This chapter ends by using theta functions to construct a modular form that 
both connects back to the preface and adumbrates the ideas at the end of the 
book. The construction is one case of a general method due to Hccke [Hec26]. 

Recall that the preface used Quadratic Reciprocity to motivate the Mod- 
ularity Theorem via a simple analog, counting the solutions modulo p to the 
quadratic equation x 2 = d. Now consider a cubic equation instead, 

C : x 3 = d, d e Z + , d cubefree, 

and for each prime p let 

a p (C) = (the number of solutions modulo p of equation C) — 1. 

Results from elementary number theory show that (Exercise 4.11.1) 
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a P (C) 



2 if p = 1 (mod 3) and d is a nonzero cube modulo p, 
-1 if p = 1 (mod 3) and d is not a cube modulo p, (4.47) 
0 if p = 2 (mod 3) or p | 3d. 



This section will use Poisson summation and the Cubic Reciprocity Theorem 
from number theory to construct a modular form 0 X with Fourier coefficients 
a p(®x) ~ a p(C)- Section 5.9 will show that these Fourier coefficients are eigen- 
values. That is, the solution-counts of the cubic equation C are a system of 
eigenvalues arising from a modular form. Chapter 9 will further place this 
example in the context of Modularity. 
Introduce the notation 

e(z) = e 2mz , zeC. 

Let A = Z[/x 3 ], let a = is/2>, and let B = ±A. Thus A C B C \A. Note that 

\x\ 2 = x\ — x\X2 + x\ for any x = x\ + x 2 ^3 € R[^3]. 

We will frequently use the formula |x + y| 2 = |a;| 2 + tr (xy*) + \y\ 2 for x,y E C, 
where y* is the complex conjugate of y and tr (z) = z + z* . For any positive 
integer N and any u in the quotient group ^A/NA define a theta function, 

0"(r,JV) - J2 e ( N \ u / N + n \ 2T ) . T<En - ( 4 - 48 ) 

neA 

An argument similar to Exercise 4.9.2 shows that 9 U is holomorphic. The 
following lemma establishes its basic transformation properties. From now 
until near the end of the section the symbol d is unrelated to the d of the 
cubic equation C. 

Lemma 4.11.1. Let N be a positive integer. Then 
& a {r + l,N)=B^pj0 a {T,N), u e B/NA, 

6>"(r, N)= fl "( dr > dN )> " e B/NA, d e Z+ 

vGB/dNA 
v=u (NA) 

e^(-l/r,N) = ^= ]T e (-^)r(r,7V), tJ e B/NA. 

weB/NA 

Proof. For the first statement compute that for m e and n e A, 
iV|^+n| 2 = M! ( mod Z ). 

For the second statement note that 0"(r, AT) = e ( d7V l | 2rfr ) ■ Lct 

n = r + dm, making the fraction u ^ r + m. Thus 
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0"(t,AO= e ( dN \ M iw Z + m \ 2dT ) = <T +Arr (dT, dN), 

reA/dA reA/dA 
rneA 

where the reduction u + Nr is taken modulo dN. This gives the result. 

The third statement is shown by Poisson summation. Recall that the defin- 
ing equation (4.42) from the previous section was extended to k = 0 in Exer- 
cise 4.10.6(a), 

0g( 7 ) = e-^ v / N+n ^\ v G (Z/NZ) 2 , 7 G GL 2 (R). 
nez 2 

To apply this let 7 G SL 2 (R) be the positive square root of ^ [_ x ~ 2 ], 
satisfying 

\(x 1 ,x 2 h\ 2 = 7gl x i + X 2M3| 2 , £i, ^2 G R 

Note that B/NA C g^/JV4 = A/3NA. Identify ^ with Z 2 so that if v G S 
then its multiple 3t> € C A can also be viewed as an element of Z 2 . 
Compute with 3N in place of N and with 7 as above that for any t G R + , 

df^N) 1 ' 2 ^/^,) 1 ' 2 ) = J2 e -^ Ar i(w +n hi 2t 

nez 2 

neA 

This shows that the identity (4.43) with k = 0, with 3iV in place of iV, with 
7 as above, and with r = (VSt)^ 1 / 2 is (Exercise 4.11.2) 

e*(-l/(it),N) = -±- e(- 3 -^-)e^3it,N), v G B/NA, 

N ^KelA/NA V 7 

where uu* = (wu*)i + {vu*) 2 ^. Generalize from t G R + to — it for r G "H by 
the Uniqueness Theorem of complex analysis to get 

0*(-l/r,N) = j£= Y, e(- (m y' )a )^a). v G B/NA. 

The sum on the right side is 

Y e(-<^) £ r(3r,7V). 

tueB/WA «£§ A/ATA 

au=w (NA) 

To simplify the inner sum note that 6 w (t, N) = J2neA e ( N \^r I1 \ 2 ^ T ) for 
any w G B/NA, and similarly to the proof of the second statement this works 
out to 
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& W ( T ,N)= ]T eU 

reA/aA ^ 
mGA 

That is, the inner sum is 



{w+Nr)/a 
N 



' 3t )= E e^ w+Nr ^ a {3T,N). 

reA/aA 



E 0 u {3t,N) =6 w {t,N), TO e B/NA. 

ue^A/NA 
au=w (NA) 

The proof is completed by noting that (avw*) 2 = tr (vw*). □ 
The next result shows how the theta function transforms under the group 



r 0 (3N,N) 



a b 
c d 



G SL 2 (Z) : b = 0 (mod N), c = 0 (mod 37V) ^ . 



Proposition 4.11.2. Let N be a positive integer. Then 

(e^} 1 )(T,N)=(l)6 m (r,N), u e A/NA, 7 = [ «*] e r 0 (37V, TV), 
/fere (rf/3) is £/ie Legendre symbol. 

= 9 au we can assume d > 0 by replacing 7 with —7 if 



Proof. Since 0 ' 
necessary. Write 



AT + b 1 



cr + d d \d/r + c 
Apply the second statement of the lemma and then the first statement to get 

H7(r),A0= E ^M-^div). 

tJ=M (JVA) 

The third statement and again the first now give 

V v,w£B/dNA 
v=u (NA) 



i(cT + d) 

v v,w£B/dNA 
v=u (NA) 



E 



2 — tr (uiu*) — c|w| 



dN 



■) 9 w (-d/r, dN). 



Note that cw e 7VA for w e B since c = 0 (mod 3iV). It follows that 

EAH 2 -t r( ^)- C |H 2 A \ ^ / — cw| 2 — tr — cw)w*) — c\w\ 2 \ 

G \ dN )— \ dN ) 



v£B/dNA 
v=u (NA) 



v£B/dNA 
v=u (NA) 



-(■ 



tr (auw*) 



E e(^) 



v£B/dNA 
v=u (NA) 
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where the last equality uses the relation ad—bc= 1 . Since b = 0 (mod N) the 
summand e (b\v\ 2 / (dN)^ depends only on v (mod dA), and since (d,N) = 1 
and u e A/NA the sum is J2veA/dA e (b\ v \ 2 / (dN)) . This takes the value 
(d/3)d (Exercise 4.11.3). Substitute this into the transformation formula and 
apply the second and third statements of the lemma to continue, 



T (l(r),N) = (") E e(-^i)^(d(-l/r),dA0 

V \ / weB / dNA 

_i(CT + d 1 fd\ E e (_^) r( _ 1/TjAr) 



vEB/NA 

ct + d ( d 



37V 2 V3 



]T e (- tr( ™*+'""'* ) ) ^(r,7V). 



v,wEB/NA 

Exercise 4.11.3(b) shows that the inner sum is 

/ tr (v(w* + au*))\ _ j 3N 2 ]£w = -au, 

^NA B V * ) 1° if * * -BU, 

completing the proof since Q~ au = 9 au . □ 

To construct a modular form from the theta functions we need to conjugate 
and then symmetrize. To conjugate, let 5 = °] so that 

sroiw 2 ^- 1 = r 0 (3N,N) 

and the conjugation preserves matrix entries on the diagonal. Recall that the 
wcight-fc operator was extended to GL^(Q) in Exercise 1.2.11. Thus for any 

7 = [ a c b d ] er 0 (3N 2 ), 

(n5 7 ]i)(r) = (0* [ 7 '<5]i)(t) where 7 ' = fry J" 1 e r 0 (3JV, TV) ^ 

= (d/3) (OS] 1 ) (r) since d = dy . ' " " 

The construction is completed by symmetrizing: 

Theorem 4.11.3. Le£ N be a positive integer and let \ ■ (A/NA)* — > C* be 
a character, extended multiplicatively to A. Define 

0x(r)=l £ xW(Nt,N). 

ueA/NA 

Then 

<y 7 h = X (d)(d/3)e x , 7 = [s 5] e r 0 (3^v 2 ). 

Therefore 

0 x eA4i(3JV 2 ,V), VW = x(rf)W3). 
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The desired transformation of 9 X under r 0 (3N 2 ) follows from (4.49) since 
0 X = J2u x{ u )0 u [5]\ (Exercise 4.11.4). To finish proving the theorem note that 

oo 

W = g E X(«)e (\n\ 2 r) = £ a m (8 x )e 2 ™ 

neA m=0 

where 

a m (0 x ) = l £ X(n). (4.50) 

neA 
\n\ 2 —m 

This shows that a n (8) — O(n), i.e., the Fourier coefficients are small enough 
to satisfy the condition in Proposition 1.2.4. 
For example, when N = 1 the theta function 

neA 

is a constant multiple of Ef' 1 , the Eisenstein series mentioned at the end 
of Section 4.8 (Exercise 4.11.5). This fact is equivalent to a representation 
number formula like those in Exercise 4.8.7. 

Along with Poisson summation, the other ingredient for constructing a 
modular form to match the cubic equation C from the beginning of the 
section is the Cubic Reciprocity Theorem. The unit group of A is A* = 
{±1, ±/i 3 , ±/i|}. Note that formula (4.50) shows that 6 X — 0 unless \ is trivial 
on A* . Let p be a rational prime, p = 1 (mod 3). Then there exists an clement 
it = a + 6^3 e A such that 

{neA: \n\ 2 = p} = A*tt U A*W. 

The choice of n can be normalized, e.g., to n = a + where a = 2 (mod 3) 
and 6=0 (mod 3) . On the other hand a rational prime p = 2 (mod 3) does 
not take the form p = \n\ 2 for any n <G A, as is seen by checking \n\ 2 modulo 3. 
(See 9.1-9.6 of [IR92] for more on the arithmetic of A.) A weak form of Cubic 
Reciprocity is: Let deZ + be cubefree and let N = 3 J7 p |dP- Then there exists 
a character 

X :(A/NA)* ^{l,» 3 ,t4} 

such that the multiplicative extension of \ to all of A is trivial on A* and 
on primes p \ N, while on elements n of A such that nn is a prime p \ N it 
is trivial if and only if d is a cube modulo p. See Exercise 4.11.6 for simple 
examples. 

For this character, 9 x (t,N) e Mi(3N 2 ,-0) where ip is the quadratic char- 
acter with conductor 3. Formula (4.50) shows that the Fourier coefficients of 
prime index are 
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a p {6 x ) 



2 if p = 1 (mod 3) and d is a nonzero cube modulo p, 
-1 if p = 1 (mod 3) and is not a cube modulo p, (4-51) 
0 if p = 2 (mod 3) or p | 3d. 



That is, the Fourier coefficients are the solution-counts (4.47) of equation C 
as anticipated at the beginning of the section. 



Exercises 

4.11.1. Show that for any prime p the map x n- x 3 is an endomorphism of the 
multiplicative group (Z/pZ)* . Show that the map is 3-to-l if p = 1 (mod 3) 
and is 1-to-l if p = 2 (mod 3). Use this to establish (4.47). 

4.11.2. Confirm that under the identification (xi,x 2 ) «-> x\ + x 2 ^3, the 
exponent —(u,vS) from Section 4.10 becomes — {vu*) 2 - Use this to verify 
the application of Poisson summation with 3u, 3v, and 37V in the proof of 
Lemma 4.11.1. 

4.11.3. For b,d e Z with (36, rf) = 1 let ip b>d = J2veA/dA e ( b \ v \ 2 / d ) ■ This 
exercise proves the formula (fb, d = (d/3)d. 

(a) Prove the formula when d = p where p 7^ 3 is prime. For p = 2 compute 
directly. For p > 3 use the isomorphism Z[\/^3]/pZ[\/^3] — > A/pA to show 
that 

<Pb,d = E e ( 6 ( r i + 3r 2)/p) • 

ri,r 2 GZ/pZ 

Show that if m is not divisible by p then 

E e(mr 2 /p)= E C 1 + (T) ) e ^ = (? ) <?(x) 

reZ/pZ sGZ/pZ 

where g(x) is the Gauss sum associated to the character x(s) = (s/p). Show 
that ,g(x) 2 = x(-l)|ff(x)| 2 = (— l/p)p similarly to (4.12). Use Quadratic 
Reciprocity to complete the proof. 

(b) Before continuing show that for any x G N~ 1 B, 

if x e B, 
if x $ B, 

if .x e A, 
if .x ^ A. 

(c) Prove the formula for d = p l inductively by showing that for t > 2, 



E e(tr (*«;*)) = 

and 

E e(tr(^)) = (^ 2 
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Vb,p*= Yl 2 e(% + p t_1 t«| 2 /l>*) 

veA/p t - 1 AweA/pA 

= e(%| 2 /V) Y e(ti(bvw*)/p)^(f b ^ t -2p 2 . 

veA/p*- 1 A weA/pA 

(d) For arbitrary d > 0 suppose that d = d\d 2 with (di,d 2 ) — 1 and 
di, c?2 > 0. Use the bijection 

A/d 1 A x A/d 2 A — > A/dA, (uT,U2) ^ ^2«i + diu 2 

to show that </?6 ; d = fbd 2 .d 1 ( Pbd 1 ,d 2 - Deduce that this holds for d < 0 as well. 
Complete the proof of the formula. 

4.11.4. Verify the transformation law in Theorem 4.11.3. 

4.11.5. (a) Use results from Chapter 3 to show that dim(<Si(_r 0 (3)) = 0, so 
that if V> is the quadratic character modulo 3 then Mi(3,i[>) = CEf' 1 . 

(b) Let 9i(t) denote the theta function in Theorem 4.11.3 specialized 
to N = 1, making the character trivial. Thus #i <G Aii(3,ip) and so #i is a 
constant multiple of Ef' 1 . What is the constant? 

(c) The Fourier coefficients a m (#i) are (up to a constant multiple) repre- 
sentation numbers for the quadratic form n\ — nin 2 + n 2 . Thus the represen- 
tation number is a constant multiple of the arithmetic function cr ( ^' 1 (m) = 
^2d\n V'(d) for m > 1. Check the relation between r(p) and ctq ' X {p) for prime p 
by using the information about this ring given in the proof of Corollary 3.7.2. 
Indeed, the reader with background in number theory can work this exer- 
cise backwards by deriving the representation numbers and thus the identity 
6*i = cEf'' 1 arithmetically. 

4.11.6. (a) Describe the character \ provided by Cubic Reciprocity for d = 1. 

(b) To describe \ f° r d = 2, first determine the conditions modulo 2 on 
a, b € Z that make a + &/13 £ A invertiblc modulo 2 A, and similarly for 3. 
Use these to show that the multiplicative group G = (A/6A)* has order 18. 
Show that A* reduces to a cyclic subgroup H of order 6 in G and that the 
quotient G/H is generated by g = 1 + 3/X3. Explain why \ 1S defined on the 
quotient and why up to complex conjugation it is 

x(H) - 1, x (gH) = Cs, x(g 2 H) - & 

(Here the symbol £3 is being used to distinguish the cube root of unity in the 
codomain C* of \ from the cube root of unity in A.) 

(c) Describe the character \ provided by Cubic Reciprocity for d = 3. 
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Hecke Operators 



This chapter addresses the question of finding a canonical basis for the space 
of cusp forms <Sfc(Ti(iV)). Since cusp forms are not easy to write explicitly like 
Eisenstein series, specifying a basis requires more sophisticated methods than 
the direct calculations of the preceding chapter. 

For any congruence subgroups Ji and _T 2 of SL 2 (Z), a family of double 
coset operators takes M.k{^i) to Aiki^), taking cusp forms to cusp forms. 
These operators are linear. Specializing to J\ = r 2 = A(iV), particular double 
coset operators (n) and T n for all n € Z+ are the Hecke operators, commuting 
endomorphisms of the vector space Mk(ri(N)) and the subspace Sk(r 1 (N)). 
The Petersson inner product makes Sk(ri(N)) an inner product space, and 
the Hecke operators (n) and T n for n relatively prime to the level TV are normal 
with respect to this inner product. Thus by linear algebra the space Sk(ri(N)) 
has an orthogonal basis whose elements are simultaneous cigcnfunctions for 
the Hecke operators away from N. Further decomposing Sfc(A(A0) into old 
and new subspaces partially eliminates the restriction: the new subspace has 
an orthogonal basis of cigcnfunctions for all the Hecke operators, and after 
normalizing this basis is canonical. The old subspace is the image of new 
subspaces of lower levels. 

The canonical basis elements for the new space, and more generally all 
normalized eigenfunctions for the Hecke operators, correspond naturally to 
Dirichlet series having Euler product expansions and satisfying functional 
equations. These Dirichlet series, the L-functions of eigenfunctions, will later 
express the connection between modular forms and elliptic curves. 

Related reading: [Lan76], [Miy89], and Chapter 3 of [Shi73]. 

5.1 The double coset operator 

Let A and _T 2 be congruence subgroups of SL 2 (Z). Then i~i and r 2 are sub- 
groups of GL^Q), the group of 2-by-2 matrices with rational entries and 
positive determinant. For each a e GL^ (Q) the set 
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Aa^ = {7i«72 : 7i € A, 72 € A} 

is a double coset in GL^(Q). Under a definition to be developed in this section, 
such double cosets transform modular forms with respect to A into modular 
forms with respect to A- 

The group A acts on the double coset A a A by left multiplication, par- 
titioning it into orbits. A typical orbit is A/3 with representative [3 — 710172, 
and the orbit space A\AaA is thus a disjoint union 1J A/3j for some choice 
of representatives f3j . The next two lemmas combine to show that this union 
is finite. 

Lemma 5.1.1. Let r be a congruence subgroup o/SL 2 (Z) and let a be an 
element o/GL^(Q). Then a^ 1 Ta n SL 2 (Z) is again a congruence subgroup 
o/SL 2 (Z). 

Proof. There exists N e Z + satisfying the conditions r(N) C A Na <E 
M 2 (Z), Na- 1 e M 2 (Z). Set TV = A 3 . The calculation 

ar(jV)a _1 C a(I + N^iZ^a' 1 

= I + N -Na- M 2 (Z) • Ao^ 1 C / + AM 2 (Z) 

and the observation that ar{N)a~ 1 consists of determinant- 1 matrices com- 
bine to show that ar(N)a~ 1 C r(N). Thus r(N) C ^^(^a C a^Ta, 
and intersecting with SL 2 (Z) completes the proof. □ 

Lemma 5.1.2. Let A and A be congruence subgroups o/SL 2 (Z), and let a 
be an element o/GL^~(Q). Set A = a _1 A« H A, a subgroup of A- TTien 
fe/i multiplication by a, 

A — ► A a A given &y 72 1 ^ 072, 

induces a natural bisection from the coset space A\A ^° ^ e orW space 
A\AaA- -fre concrete terms, {72.^} is a set of coset representatives for A\A 
z/ and onfo/ if {f3j} — {aj2.j} is a set of orbit representatives for A\AaA- 

Proof. The map A — >• A\AoA taking 72 to A 072 clearly surjects. It takes 
elements 72,72 to the same orbit when A 072 = Acr^i i- e -> 72 72 -1 e a 
and of course 7 2 72~ 1 € A as well. So the definition A = or 1 A a H A gives 
a bijection A\A — >■ A\AaA from cosets A72 to orbits A 072- The last 
statement of the lemma follows immediately. □ 

Any two congruence subgroups G\ and G2 of SL 2 (Z) are commensurable, 
meaning that the indices \G\ : G\ n G2} and [G2 ■ G\ n G2] are finite (Exer- 
cise 5.1.2). In particular, since a _1 A a H SL 2 (Z) is a congruence subgroup 
of SL2(Z) by Lemma 5.1.1, the coset space A\A in Lemma 5.1.2 is finite 
and hence so is the orbit space A\AoA- With finitencss of the orbit space 
established, the double coset A«A can act on modular forms. Recall that for 
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(3 e GL^(Q) and fceZ, the weight-k j3 operator on functions / : % — > C is 
given by 

(Mk)(T) = (det/3) fc - 1 j(/3,r)- fc /(/3(r)), r e ft. 

Definition 5.1.3. For congruence subgroups A and A °/ SL 2 (Z) and a G 
GL^(Q), t/ie weight-k AaA operator takes functions f e .Mfc(A) to 

/[Aar 2 ] fc = ^/[/y fc 

where {/3j} are orbit representatives, i.e., AaA = Uj : A/?j * s a disjoint 
union. 

The double coset operator is well defined, i.e., it is independent of how 
the /3j are chosen (Exercise 5.1.3). Seeing that it takes modular forms with 
respect to A to modular forms with respect to A, 

[AqAU : M fc (A) — ► M fc (r 2 ), 

means showing that for each / e _M/c(A), the transformed /[A^jii is A- 
invariant and is holomorphic at the cusps. Seeing that the double coset oper- 
ator takes cusp forms to cusp forms, 

[r ia r 2 ] k ^s k {r 2 ), 

means showing that for each / G <Sfc(A), the transformed /[AaA]fc vanishes 
at the cusps. 

To show invariance, note that any j 2 e A permutes the orbit space 
A\AoA by right multiplication. That is, the map 72 : A\AaA — > 
A\AaA given by h-> A/?72 is well defined and bijective. So if {/3j} 
is a set of orbit representatives for A\AoA then {^72} is a set of orbit 
representatives as well. Thus 

(/[Aar 2 ] fe )[ 72 ] fe = Y,f[PMk = /[A«A] fe , 
j 

and /[AaA]fc is weight-/c invariant under A as claimed. 

To show holomorphy at the cusps, first note that for any / <G Alfc(A) 
and for any 7 <G GL 2 (Q), the function g — f["f]k is holomorphic at infinity, 
meaning it has a Fourier expansion 

g(r) = ]T a n (g)e 2 ^/h 

n>0 

for some period h e Z + (this was Exercise 1.2.11(b)). Second, note that if 
functions gi,...,gd : % — > C are holomorphic at infinity, meaning that each 
gj has a Fourier expansion 
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n>0 

then so is their sum gi + ■ ■ ■ + gd (Exercise 5.1.4). For any 8 £ SL 2 (Z), the 
function (AA a A]fc)[<5]fc is a sum of functions gj = /[7j]fe with jj = (3jS G 
GL^(Q), so it is holomorphic at infinity by the two facts just noted. Since S 
is arbitrary this is the condition for holomorphy at the cusps. 

For any / G <Sfc(_Ti) and for any 7 e GL^(Q), the function g — f[~f]k van- 
ishes at infinity (this was also Exercise 1.2.11(b)), and the previous paragraph 
now shows that / Ai a A]fc £ <Sfc(A), i.e., the double coset operator takes cusp 
forms to cusp forms as claimed. 

Special cases of the double coset operator [Ac^Ajfc arise when 

(1) A D A- Taking a — I makes the double coset operator be /[A«A]fe = /, 
the natural inclusion of the subspacc .Mfc(A) in -A4fc(A) ; an injection. 

(2) a~ 1 Tia = A- Here the double coset operator is /[AaAzjfc = /[o;]fe, the 
natural translation from A4fc(A) to AA(A), an isomorphism. 

(3) A C A- Taking a = I and letting {72,^} be a set of coset representatives 
for A\A makes the double coset operator be /[A«A]fe = ^A" /[72,j]fei 
the natural trace map that projects AA(A) onto its subspace Alfc(A) 
by symmetrizing over the quotient, a surjection. 

In fact, any double coset operator is a composition of these. Given A 5 A, 
and a, set A = a _1 Aa H A as usual and set A = otr^a~ l = A H aAct - • 
Then A 3 A an( ^ a _1 A Q! = A an d A C A, giving the three cases. The 
corresponding composition of double coset operators is 

i 

which by Lemma 5.1.2 is the general [ActAjfc. 

The process of transferring functions forward from A^fe(A) to A^fc(A) 
by the double coset operator also has a geometric interpretation in terms 
of transferring points back between the corresponding modular curves. This 
leads to an algebraic interpretation of the double coset as a homomorphism of 
divisor groups. Recall that every congruence subgroup _T has a modular curve 
X(r) = r\H* consisting of orbits Ar. The configuration of groups is 

A A 

where the group isomorphism is 7 H 017a -1 and the vertical arrows are 
inclusions. The corresponding configuration of modular curves is 
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X 3 



^X> 3 



(5.1) 



where the modular curve isomorphism is I3T r 3 a(r), denoted a (Exer- 
cise 5.1.5). Again letting r 3 \r 2 = (J ■ r 3 j 2 j and (3j — aj 2 j for each j so that 
= Uj each point of X 2 is taken back by 7Ti o a o 71-^ 1 to a set of 

points of X 1; 

{% J M}^{^ j (r)} 



r 2 r 



{A/3,(r)}. 



Here 



takes each point x G A 2 to the multiset (meaning elements can 



repeat) of overlying points y G X 3 each with multiplicity according to its 
ramification degree, 7r^" 1 (x) = {e y ■ y : y G X 3 , n 2 (y) = x}. To place the 
composition in the right environment for counting with multiplicity let Div(A) 
denote the divisor group of any modular curve X, where as in Chapter 3 the 
divisor group of a set is the free Abelian group on its points. In terms of 
divisors the composition is 

[Aar 2 ] fc : X 2 — ► Div(Xi), T 2 r ^ ^ A&M, 



and this has a unique Z-lincar extension to a divisor group homomorphism, 

[Aar 2 ] fe : Div(X 2 ) — > Div(Xr). 

In this context the special cases _Ti D _T 2 , a Ac* = _T 2 , and J\ C A from 
before lead respectively to a surjection, an isomorphism, and an injection 
(Exercise 5.1.6). 

This chapter will focus on double coset operators acting on modular forms, 
but the emphasis later in the book will move toward the divisor group inter- 
pretation. 



Exercises 

5.1.1. In Lemma 5.1.1, need a _1 fa lie in SL 2 (Z)? 

5.1.2. Show that for congruence subgroups G\ and G 2 of SL 2 (Z), the indices 
[G\ : Gi n G 2 ] and [G 2 : G\ n G 2 ] are finite. (A hint for this exercise is at the 
end of the book.) 

5.1.3. Show that the wcight-fc double coset operator [-TioA]fc is independent 
of how the orbit representatives j3j are chosen. (A hint for this exercise is at 
the end of the book.) 
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5.1.4. Show that if gi, : H — > C are holomorphic at infinity, then so is 
their sum gi + ■ ■ ■ + g d . (A hint for this exercise is at the end of the book.) 

5.1.5. Check that the map a : X 3 — > X' 3 given by A T M> r 3 a(r) is well 
defined. 

5.1.6. Show that in the special cases A D A 5 a^ 1 Tia = A 5 and A C A the 
divisor group interpretation of [A<aA]fc gives a surjection, an isomorphism, 
and an injection. 



5.2 The (d) and T p operators 

Recall the congruence subgroups 



To (AO = 



a b 
c d 



G SL 2 (Z) : 



a b 
c d 



0 * 



(mod N) 



and 



A (AO = 



a b 
c d 



G SL 2 (Z) : 



a b 

c d 



1 * 
0 1 



(mod N) 



where as always A^ is a positive integer. The smaller group has more modular 
forms, i.e., A4fc(A(A r )) D A / Jfc(r 0 (A r )). This section introduces two operators 
on the larger vector space Mk(ri(N)). 

The map r 0 (N) — > (Z/NZ)* taking [ a c h d ] to d (mod N) is a surjective 
homomorphism with kernel ^(N). This shows that A (AO is normal in r 0 (N) 
and induces an isomorphism 



ro(N)/r\(N) (Z/NZ)* where 



a b 
c d 



1 y d (mod N). 



To define the first type of Hecke operator, take any a G r 0 (N), set I\ = 
r 2 = A (AO, and consider the weight-/c double coset operator [AaAjfe- Since 
A (AO < A> (AO this operator is case (2) from the list in Section 1, translating 
each function / G A4fc(A(A0) to 

/[A(A r )aA(A r )]fe = f[a]k, a G A(A r ), 

again in A / Jfe(A(A r )). Thus the group A) (AO ac ts on .Mfc(A(A0), and since its 
subgroup A(A0 acts trivially, this is really an action of the quotient (Z/NZ)* . 
The action of a = [ " J ] , determined by d (mod AT) and denoted (d) , is 



(d> : M fc (A(JV)) — ^M fc (A(JV)) 



given by 
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(d).f = .f[a]k for any a 



a b 
c 5 



£ r 0 (N) with S = d (mod N). 



This is the first type of Hecke operator, also called a diamond operator. For 
any character \ : (Z/NZ)* — > C, the space Mk(N,x) from Chapter 4 is 
precisely the x-eigenspace of the diamond operators, 

M k (N, X ) = {.f € M fc (ri(JV)) : (d)/ = for all d G (Z/7VZ)*} . 

That is, the diamond operator (d) respects the decomposition A4k(r\(N)) = 
Mk(N, x), operating on the eigenspace associated to each character x as 

multiplication by x(d). 

The second type of Hecke operator is also a weight-A: double coset operator 
[riar 2 ]k where again I\ = _T 2 = A (AO, but now 



a 



1 0 
0 p 



p prime. 



This operator is denoted T p . Thus 

T p : Mkir^N)) — ► M fe (A(iV)), P prime 

is given by 

1 A(JV)] t . 



T p f = f[r 1 (N) 

From (3.16), the double coset here is 



1 0 
0 p 



A (AO 



1 0 
Op 



A(iV)= t£M 2 (Z) : 7 : 



1 * 
0 p 



(mod AT), det7 = p 



so in fact [J°] can be replaced by any matrix in this double coset in the 
definition of T p . 

The two kinds of Hecke operator commute. To see this, continue to let 
a = [op] and check that -ycr/ -1 = [J *] (mod N) for any 7 G r 0 (A r ). If 

r 1 {N) a r 1 {N) = \Jr 1 {N)p j 

3 

then the last sentence of the preceding paragraph and the fact that J\ (N) is 
normal in Jo (AT) show that the double coset is also 

A(A0<xTi(A0 = r 1 (A^)7a 7 - 1 A(Af) = jr^ar^N)^ 1 

= 1 \Jr 1 (N)p n - 1 = (Jr 1 (AT) 7 /3 i7 - 1 . 



Comparing the two decompositions of the double coset gives (J • J\(A0 7 /3j = 
Uj A(A r )/?j 7 , even though it need not be true that ri(A r ) 7 /3 J - = ^(N)^ 
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for each j. Thus for any / e Mk(ri(N)) and any 7 G r 0 (N) with lower right 
entry 5 = d (mod N), 

(d)T p f = ]T fWnh = ]T fhMk = T p {d)f, f e M fc (A(iv)). 



To find an explicit representation of T p , recall that it is specified by orbit 
representatives for ii\ria!.r 2 , and these are coset representatives for r 3 \r 2 
left multiplied by a, where I3 = a _1 ria n For the particular ri, T"^, and 
a in play here, recall from Exercise 1.5.6 the groups 



r°(p) 



a & 
c d 



G SL 2 (Z) : 



a & 
c d 



* 0 



(mod p) 



and 



r 1 °(JV, P ) = r 1 (iV)nr 0 (p). 

Then _T 3 = r^(N,p) (Exercise 5.2.1). Since r 3 is T 2 subject to the additional 
condition 6 = 0 (mod p), the obvious candidates for coset representatives are 



72,. 



1 j 
0 1 



0 < j < P- 



Given 72 = [° £ T 2 , we have 72 G r 3 7 2J if 7272,] e T 3 = T 2 n r°(p). 
Certainly 727^) G A for any j since 72 and 7 2j - are, but also we need the 
upper right entry b — ja of 727^j = [" d] [0 ~i] t° be 0 (mod p). 

If p \ a then setting j = ba^ 1 (mod p) does the job. But if p | a then 
6 — ja can't be 0 (mod p) for any j, for then p \ b and so p | ad — be = 1. 
Instances of 72 G T 2 with p \ a occur if and only if p \ N, and when this 
happens 72,0, • • ■ >72, P -i fail to represent r 3 \r 2 . To complete the set of coset 
representatives in this case, set 



72, CO 



mi) n 
N 1 



where mp — nN = 1. 



Now given 72 



\ab' 

L c 



G r 2 with p j a, it is easy to show that 7272.^0 *= A 



as needed (Exercise 5.2.2). Thus 72,0, • • • >72 lP -i arc a complete set of coset 
representatives when p \ N, but 72,00 is required as well when p \ N. In any 
case, it is easy to show that the 72 j represent distinct cosets (Exercise 5.2.3). 
The reader may recognize these matrices from Exercise 1.5.6(b). 

The corresponding orbit representatives (3j — aj 2 ,j for 7\\i _ iai~2, needed 
to compute the double coset operator, work out to 



1 J 
0 p 



for 0 < j < p, p o 



m n 




' p 0" 


N p 




0 1 



\ip\N. (5.2) 



This proves 
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Proposition 5.2.1. Let N e Z+, let A = A = A(-/V)> and feia= [J °] 
w/iere p is prime. The operator T p — [AaA]fc on M-k(r\(N)) is given by 



T P f = { 



(p-i 

U'=° 



i/p 1 AT, 



U + /[[iVp] [S?]]* »/Pt^. where mp-nN=l. 



Letting A = A = r 0 (N) instead and keeping a = [op] gives the same 
orbit representatives for A\A a A (Exercise 5.2.4), but in this case the last 
representative can be replaced by (3^ = [ jj ? ] since [ jv p ] € A • 

The next result describes the effect of T p on Fourier coefficients. 

Proposition 5.2.2. Let f e -M fe (A(-/V))- Smce [J }] € A(N), / /ias period i 
and ftence /ias a Fourier expansion 

oo 



n=0 



T/ien: 



(a) Le£ ljv : (Z/NZ)* — > C* be the trivial character modulo N. Then T p f 
has Fourier expansion 

oo oo 

(T p f)(r) = E o n p(/)9 n + M^P*" 1 E M<P>/)<Z np 

n=0 n=0 

oo 

= E + ljv(P)P fc_1 an/ P «P>/))« n - 

71 = 0 

77ia£ is, 

a„(T p /) = a np (f) + l N {p)p k - 1 a n/p {{p).f) for f e M^N)). (5.3) 

(Here a n / p = 0 w/ien n/p ^ N. As in Chapter 4-, 1jv(p) — 1 w/ien p \ N 
and 1n(p) = 0 when p | AT.) 

(b) Let x ■ (Z/NZ)* — > C* be a character. If f € M k (N,x) then also 
T p f g A4k(N, x), and now its Fourier expansion is 



(T P f)(r) = E M/)?" + X(p)p k - 1 E a M)<l 

oo 

= E(s(/) + *)/"V/ P (/))4 n 



71=0 



T/iai is, 

a n (T p f) = a np (f) + x(p)p k ' 1 a n / P (f) for f G M fc (JV, x ). 



(5.4) 
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Proof. For part (a) , take 0 < j < p and compute 

1 00 

i)/p 



f[\H]Ur) =p k -\0r + p)- k f (^±1) = Iga„(/)e 2 ™^) 

1 OO 



P n =o 



where q p = e 27 " T//p and p p = e 2lTt ^ p . Since the geometric sum X^j=o A 4 ? 7 ' is P 
when p | TV and 0 when p \ N, summing over j gives 

p— 1 oo 

E/[[op]u( 7 -)= E = Em/)9" 

j=0 n=0 (p) n=0 

This is (T p /)(r) when p | iV. When p{ AT, (T p /)(r) also includes the term 
/[[^?][S?]]*(r) = ((p>/)[[g;]] fc (r) 

OO 

= p*- 1 ^ + l)- k ((p).f)(pr) = p*" 1 £ a„«p> /)<?"*>. 

n=0 

In either case, the Fourier series of T p / is as claimed. 

The first statement of (b) follows from the relation (d)(T p f) = T p ((d)f). 
Formula (5.4) is immediate from (5.3). □ 

Since double coset operators take cusp forms to cusp forms, the T p op- 
erator restricts to the subspace Sk(r\(N)) of Mk(ri(N)). In particular the 
weight 12 cusp form A, the discriminant function from Chapter 1, is an eigen- 
vector of the T p operators for SL 2 (Z) since <Si2(SL 2 (Z)) is 1-dimensional, 
and similarly for the generator of any other 1-dimensional space such as the 
functions <Pkij) = ri{r) k r](N r) k from Proposition 3.2.2. Part (b) of the propo- 
sition here shows that T p further restricts to the subspace Sk(N, x) for every 
character \ modulo N. 

Eisenstein series are also eigenvectors of the Hecke operators. Recall the 
series E^' v from Chapter 4 for any pair tp, ip of Dirichlet characters modulo u 
and v such that uv \ N and (ip<p)(— 1) = (— l) fe , 

oo 

Et v (r) = S(yj)L(l - k,<p) + 2j2 a t-i(n)l n , 1 = ^ 

n=l 

m = (o otherwise = £ V^/mMm)™*" 1 . 

\ m\n 

m>0 

Recall also the series i?^'*''*(r) = Ef' v (tT) where ip and ip are primitive and 
t is a positive integer such that tuv \ N. From Chapter 4, as (ip, ip, t) runs 
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through a set of such triples such that iptp — \ a t level N these Eisenstein 
series represent a basis of £^{N,x)- (At weight k — 2 when ^ = = li 
the definition is different, E^ 1 ' 11 '*^) = E^ lM (t) - ti^ 1 ' 11 (tr), so the triple 
(li, li, 1) contributes nothing and is excluded.) By definition of A4k{N, x) as 
an cigenspace, (d)E^ v ' t = xWEf'*'* for all d relatively prime to N, but also 

Proposition 5.2.3. Let x modulo N, tp, ip, and t be as above. Let p be prime. 
Excluding the case k = 2, ip = ip = li, 

T p Et^ = m P ) + vWifi- 1 )^"'* ifuv = N or ifp\N. 

Also, 

, , t , , t if t is prime and N is a power of t 

or if p \ N. 

This is a direct calculation (Exercise 5.2.5). When uv < N (excluding the 
special case k = 2, ip — ip = li) the series E^ v,t is "old" at level N in the 
sense that it comes from a lower level, being the series Ef' v at level uv raised 
to level tuv by multiplying the variable by t and then viewed at level N since 
tuv | N. On the other hand, when uv = N (or in the special case if also t 
is prime and N = t) the series is "new" at level N, not arising from a lower 
level. The proposition shows that Eisenstein series at level N are eigenvectors 
for the Hecke operators away from the level, and new Eisenstein series at 
level N are eigenvectors for all the Hecke operators. We will see this same 
phenomenon for cusp forms later in the chapter. 

The Hecke operators commute. 

Proposition 5.2.4. Let d and e be elements of (Z/JVZ)* , and let p and q be 

prime. Then 

(a) (d)T p = T p (d), 

(b) (d)(e) = (e)(d) = (de), 

(c) TpTq — TqTp. 

Proof. Part (a) has already been shown. Since the (d) and T p operators pre- 
serve the decomposition Mk(ri(N)) = Q^Mk(N, x), it suffices to check (b) 
and (c) on an arbitrary / € Mk(N, \). Now (b) is immediate (Exercise 5.2.6). 
As for (c), applying formula (5.4) twice gives 

a n (T p (T q f)) = a np (T q f) + x{p)p k ^a n/p {T q f) 

= a npq (f) + x(q)q k ^ 1 a np /q(f) 

+ x(p)p ~ ( ^riq/p (/) + X(q)q k - 1 ^n/pq (/)) 
= a npg {f) + x(q)q k ^ 1 a np / q (f) + X(p)p k ~ 1 a nq / P (f) 

+ x(pq){pq) ~ (/), 

and this is symmetric in p and q. □ 
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The modular curve interpretation of T p as at the end of Section 5.1 is 



T p : Divp^AT)) — ► Div(Xi(AT)), A(7V)r ^ ]T r^N)^), (5.5) 



with the matrices (3j from (5.2), excluding (3^, when p \ N. (Strictly speaking 
the map on the left is the Z- linear extension of the description on the right, 
whose domain is only X\(N), but we use this notation from now on without 
comment.) There is a corresponding interpretation of T p in terms of the mod- 
uli space Si(iV) from Section 4 of Chapter 1. Recall that Si(7V) consists of 
equivalence classes of enhanced elliptic curves (E, Q) where E is a complex 
elliptic curve and Q is a point of order N. Let Div(Si(AT)) denote its divisor 
group. Then the moduli space interpretation is 



T p :Div(Si(AT))^Div(Si(AO), [E, Q] ^ ^[£/C, Q + C], (5.6) 



where the sum is taken over all order p subgroups C C E such that Cfl (Q) = 
{Oe} and where the square brackets denote equivalence class. To see where 
this comes from, recall from Chapter 1 the lattice A T = rZ © Z and the 
complex elliptic curve E T — C/A T for r e H. Associate to each f3j (including 
/3oo when p \ N) a subgroup C = Cj = cApj T ) of E T where c e C, working 
out to C = ((t + j) jp) + A T for 0 < j < p and to C = (1/p) + A T for j = oo 
(Exercise 5.2.7(a)) and therefore satisfying the conditions 



The groups Co, . . . , C p _i, Coo are subgroups of E T [p] and pairwise disjoint 
except for 0, making their union a subset of E T [p] totaling 1 + (p+ 1) = 

p 2 elements, i.e., their union is all of i£ T [p]- (^ cc Figure 5.1 and Exer- 
cise 5.2.7(b,c).) Any subgroup C of E T satisfying the first condition of (5.7) 
must lie in -E T [p], making it one of the Cj. The group fails the second 
condition of (5.7) when p \ N. Thus the matrices (3 appearing in T p describe 
the subgroups C in the moduli space interpretation (5.6). This discussion 
elaborates on Exercise 1.5.6(a). 

The moduli space Si (AT) is in bijective correspondence with the noncom- 
pact modular curve Yi(N) — r 1 (N)\H by Theorem 1.5.1. The relation be- 
tween descriptions (5.5) and (5.6) of T p is summarized in the following com- 
mutative diagram (Exercise 5.2.7(d)): 



C 



C Z/pZ in E T , Cn ((1/JV) + A T ) = {0} in E T . 



(5.7) 



P(Si(AT)) 



>X>(Si(JV)) 



(5.8) 



V{Y 1 {N)) 



^V(Y 1 (N)). 



Here the vertical map ipi is the bijection from Theorem 1.5.1. Elcmcntwisc 
the mappings are 
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0 1 
Figure 5.1. E T [5] as a union of six 5-cyclic subgroups 



fi(JV)rh 



->E,-A(^(r). 



When p { N the term [E T /Coo, + C^] works out to [_E pr , + yl pr ] (Exer- 
cise 5.2.7(e)). 

Similarly and more easily for the diamond operator, there is a commutative 
diagram 



Si(A0 

i/>i 



(d) 



>Si(iV) 

■01 



(5.9) 



yi(jv)-^ii(JV), 



where if a = [" |] e r 0 (N) with (5 = d (mod iV) then the mappings are given 

by 



A(7V)rh 



->A(iV)a(T). 



This was Exercise 1.5.3. Naturally, (d) extends Z-linearly to divisors. 

To summarize, we have four compatible notions of the Hecke operator T p , 
starting from the double coset A (N) [JJ] A (AT) . This works out to 



7 e M 2 (Z) : 7 = 



1 * 
Op 



(mod N), det-f = p 



similar to the definitions of congruence subgroups by conditions on integer 
matrices. The double coset gives the second version of T p , a linear operator 
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on the space of modular forms Mk(Ti(N)), 

3 

where A (AT) [J °] A (TV) = (Jj r i{ N )P 3 - Thu s T p /(r) evaluates / at a set of 
points associated to r. In terms of points themselves, a third version of T p is 
the endomorphism of the divisor group of the modular curve X\ (N) induced 

by 

3 

By the general configuration of the double coset operator from Section 5.1, 
this lifts each point of Xi(N) to its overlying points on the modular curve 
X 3 = X%(N,p) of the group A = rf(N,p) = A (AT) n r°(p), translates them 
to another modular curve X 3 over Xi(N) by dividing them by p (since the 
definition uses the matrix [JJ]), and then projects them back down. That is, 
T p factors as 

T p : A(A0t ^ ]T A 72j (r) ^A^jW/p, 

3 3 

where [op]72,j = Pj for each j (see Exercise 5.2.9). The net effect is to 
take a point at level A^ to a formal sum of level A" points associated to it 
in a manner depending on p. Section 6.3 will revisit this description of T p , 
and Exercise 7.9.3 will show that X' 3 is the modular curve Xi t o(N,p) of the 
group A,o(A r ,p) = A (AT) n r 0 (Np). Fourth, T p is an endomorphism of the 
divisor group of the moduli space Si(A f ), induced by 

T p : [E,Q] ^ ^[A/C,Q + C]. 

c 

Similarly to the third version (recall (1.11) and see Exercise 5.2.9), this factors 
as 

T P : [E, Q] i y ]T[A C, Q] h+ ^[S/C, Q + C}. 

c c 

Exercises 

5.2.1. Show that when A = A = A (AT) and a = [J°], the group T 3 = 
a -1 A a n A works out to lf(N,p). 

5.2.2. When 72 = [" |j] w ^ n P I a ' snow that 72 7 2~ (X) e A as needed. 

5.2.3. Show that the represent distinct cosets. 

5.2.4. Show that letting A = A = T a (N) and keeping a = [J °] gives the 
same orbit representatives for A\A«A- 
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5.2.5. This exercise proves Proposition 5.2.3. 

(a) Show that the generalized divisor sum ct^I^ is multiplicative, i.e., 

&t-i( nm ) = a fe-^( n ) cr fe-^i( m ) wnen gcd(n,m) = 1. 

(b) Let p be prime and let n > 1. Write n — n'p e with p \ n' and e > 0. 
Use part (a) to show that 

at\{np) = ^{p)*t\{n) + 

Also use part (a) to show that when e > 1 and p \ N, 

X {p)p k ~^t\{n/p) = V {p)p k -^t\{n) - V {p^){p^) k -^t\{n'). 

Use the two formulas to show that for all e > 0 and p \ N, 

4-i^p) + x{p)P k - 1 4-M/p) = + p(p)p*- VjfiW 

Show that this formula also holds for all e > 0 and p\uv. 

(c) Excluding the case k = 2, ip = ip = li, use part (b) and formula (5.4) 
to show that if uv = N or p \ N then 

a n (T p Et^ 1 ) - OMp) + ^(f,)/- 1 )^^''), n > 1. 

Show the same result for n = 0. (Hints for this exercise are at the end of the 
book.) 

(d) Complete the proof by computing a^E^ 1 ' 11 '*) and a n {T p E2 1,ll,t ) for 
all n. 

5.2.6. Verify part (b) of Proposition 5.2.4. 

5.2.7. (a) Show that for each r € % and each matrix (ij — q p where 

0 < j < P, the group C = ^/3,(r) is (( T + j)/p) + ^r- Show that this group 
satisfies conditions (5.7). If p \ N show that for each r £ T-L and the matrix 
/?co - [ft p] [I ?], the group C - = (TVr + 1)A 0oo{t) is + A T and 

this group satisfies conditions (5.7). (A hint for this exercise is at the end of 
the book.) 

(b) How do the subgroups of E T [5] in Figure 5.1 correspond to the matrices 

00,..., 06, Poo? 

(c) As an alternative to the counting argument in the section, consider any 
subgroup C C E T satisfying (5.7). By the first condition [E T [p] : C] — p and 
so a basis of C is 



0 



, /3SM 2 (Z), det(0)=p. 



Two such matrices 0 and 0' specify the same group exactly when 0' = j0 for 
some 7 € SL2(Z). If 0 = [" J}] then gcd(a, c) is 1 or p. Show that repeated left 
multiplication by elements of SL 2 (Z) to carry out the Euclidean algorithm on 
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a and c and then further left multiplication if necessary reduce j3 to exactly 
one of A), • • ■ , Pp-i, Poo- 

(d) Show diagram (5.9) commutes by showing that tpi([E/Cj, 1/N+Cj]) = 
ri(N)(3j(r) for all j, including j = oo when p \ N . 

(e) Show that [Er/C^, ± + C^] = [E pT , & + A pr ] when p \ N. 

5.2.8. Evaluate T 3 (SL 2 (Z)i) e Div(SL 2 (Z)\%*), viewing T 3 in the sense 
of (5.5) and keeping an eye out for ramification. (A hint for this exercise 
is at the end of the book.) 

5.2.9. Specializing the general double coset operator to the data for T p , i.e., 
A = r 2 = A (TV) and a = [hp], so that A = r?(N,p), the corresponding 
modular curve X 3 is denoted X®(N,p). Show that in the configuration (5.1) 
the maps 7r 2 and 7Ti o a from X°(AT,p) to Xi(N) are 

7r 2 (rf (JV,p)r) = A(7V)r, (tt! o a)(Jf (JV,p)r) = A(JV)t/ P . 

The moduli space versions of these maps were given in (1.11) and derived in 
Exercise 1.5.6. Show that their descriptions and the commutative diagram 

Si(JV) < S?(7V,p) ► Si(JV) 



V-i 



Xl (N) 4^- X°(N,p) ^ X^N) 
give another derivation of (5.6). 

5.3 The (n) and T n operators 

So far the Hecke operators id) and T p are defined for d <G (Z/NZ)* and p 
prime. This section extends the definitions to (n) and T n for all n € Z+. 

For n € Z + with (n, iV) = 1, (n) is determined by n (mod AT). For n € Z + 
with (n,N) > 1, define (n) = 0, the zero operator on .Mfc(A(-W)). The 
mapping n h->- (n) is totally multiplicative, i.e., (ran) = (n)(m) for all n, m e 
Z+. 

To define T„, set A = 1 (the identity operator); T p is already defined for 
primes p. For prime powers, define inductively 

T p r=T p T pr -i-p k - 1 {p)T p r-2, {ovr>2, (5.10) 

and note that inductively on r and s starting from Proposition 5.2.4(c), 
T p T-T q s — T q sTpr- for distinct primes p and q. Extend the definition multi- 
plicative ly to T n for all n, 



T„ = Y[ Tp'* where n = J| p\ % , 



(5.11) 
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so that the T n all commute by Proposition 5.2.4 and 

T nm = T n T m if (n, m) = 1. 

Exercise 5.3.1 gives a more direct description of T n on A4fc(-To(./V)). To moti- 
vate the mysterious-looking prime power definition, let g(s) be a generating 
function of the T n , 

oo 
n=l 

Then definitions (5.10) and (5.11), determining all the T„ in terms of T p 
and (p), are encapsulated as a product expression for g (Exercise 5.3.2), 

9(s) = T p p- S + (p)^- 1 " 2 ')- 1 , (5.12) 

p 

the product taken over all primes. This idea will be pursued further in Sec- 
tion 5.9. 

The Fourier coefficient formulas in Proposition 5.2.2 generalize to 
Proposition 5.3.1. Let f e Mk(Fi(N)) have Fourier expansion 

oo 

f(r)=J2 a m(f)q m where q = e 2 ™ . 
Then for all n € Z+, T n f has Fourier expansion 

oo 

(Tn.f)(r) = a m(T n f)q m , 

where _ 

a m (T n f)= J2 d^a^/^dd)/). (5.13) 

d|(m,n) 

In particular, if f € Mk(N, \) then 

a m {T n f)= Y, X(d)d k - 1 a mn/d ,(f). (5.14) 

d\(m,n) 

Proof. As usual, we may take / € M^{N, x) an d thus formula (5.13) reduces 
to (5.14). Formula (5.14) checks trivially when n = 1 and it reduces to (5.4) 
when n — p is prime (Exercise 5.3.3). Let r > 2 and assume (5.14) holds for 
n = l,p,p 2 , . . . ,p r_1 . By definition of T p r and formula (5.4), 

a m (T p rf) = a m {T p {T pr -,f)) - /- 1 a m ((p)(T pr _ 2 /)) 
= a mp (T pr -if) + x(p)p k ~ 1 a m / P (T p r-if) 
- X (p)p k - 1 a m (T pr - 2 f). 
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By induction on r this is 

a m (T p rf) = ^2 X(d)d k ' 1 a mpr/d 2(f) 

rf|(mp,p r_1 ) 

+ x(p)p k ~ 1 xM^^Vp-v^t/) 

d\ (m,p r ~ 2 ) 

The first term is 

(/)+ E 

®"mp r /d 2 (/), 

rf|(mp,p r_1 ) 
<i> 1 

and the sum here cancels the third term. Thus 

a m {T p rf) = a mp r(f) + x(p)p k ~ 1 E x{d)d k - 1 a mpr - Vd 2{f). 

A little inspection shows that this regroups to the desired formula (5.14) with 
n = p r (Exercise 5.3.4). 

Finally, take ni,n 2 £ Z + with (ni,n 2 ) = 1. Then 

a m (T ni (T n J)) = E x(rf)^" 1 a ro „ l/d 2(T„ 2 /) 

rf|(m,ni) 

= E X(rf)rf fe_1 Yl X(e)e k - 1 a mnin2/d 2 e 2(f) 7 

d\(m.n-i) e| (mn-i/d 2 .JI2) 

and this regroups to formula (5.14) (Exercise 5.3.4 again). □ 
Exercises 

5.3.1. Let / £ M k (r 0 (N)). For each n £ Z+ define 

S„ = {deZ+:d|n,gcd(n/d,AO = l}, M n = \J \J 

des n j=o 

Show that T n f = J2 jeMn f[j]k- (A hint for this exercise is at the end of the 
book.) 

5.3.2. Show that definitions (5.10) and (5.11) imply relation (5.12). 

5.3.3. Check that Proposition 5.3.1 reduces to formula (5.4) when n = p is 
prime. 

5.3.4. Convince yourself that the two expressions computed in the proof of 
Proposition 5.3.1 regroup to formula (5.14) as claimed. 



n/d j 
0 d 
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5.4 The Petersson inner product 

To study the space of cusp forms Sfc(Ti(iV)) further, we make it into an inner 
product space. The inner product will be defined as an integral. The first 
few results will establish that the integral in question converges and is well 
defined. It docs not converge on the larger space Mk(ri(N)), so the inner 
product structure is restricted to the cusp forms. 

Define the hyperbolic measure on the upper half plane, 

dxdy 

dfifr) — — t — x + ty e ft. 

y 

This is invariant under the automorphism group GL^(R) of "H, meaning 
d^{a{r)) — dfi(r) for all a e GL^(R) and t e % (Exercise 5.4.1(a)), and 
thus in particular d/i is SL 2 (Z)-invariant. Since the set Q U {oo} is countable 
it has measure zero, and so d[i suffices for integrating over the extended upper 
half plane %* . Recall from Chapter 2 that a fundamental domain of %* under 
the action of SL 2 (Z) is 

V* = {t e U : Re(r) < 1/2, |r| > 1} U {oo}. 

That is, every point r' G % transforms under SL 2 (Z) into the connected 
set T>, and barring certain cases on the boundary of T> the transformation is 
unique; and every point ,s e QU {oo} transforms under SL 2 (Z) to oo. For any 
continuous, bounded function ip : % — > C and any a e SL 2 (Z), the integral 
f v „ <^(a(r))d^(r) converges (Exercise 5.4.1(b)). 

Let r C SL 2 (Z) be a congruence subgroup and let {<x, } C SL 2 (Z) repre- 
sent the coset space {±/}r , \SL 2 (Z), meaning that the union 

sl 2 (z) = \J{±i}r aj 

3 

is disjoint. If the function ip is r'-invariant then the sum <p(aj(r))dii(T) 
is independent of the choice of coset representatives otj. Since d\i is SL 2 (Z)- 
invariant the sum is Jy Q .( X ,»- ) < p(' r )^M( T )- And since {Jaj(T>*) represents the 
modular curve X(r) up to some boundary identification (cf. the end of Sec- 
tion 2.3), this quantity is naturally denoted J x ^ r y Thus we have made the 
definition 

/ <P{r)dn{T) = ^(t)^(t) = V/ ip(aj(T))dn(T). 

Jx(r) AJM£>*) j Jv ' 

In particular, setting <p = 1, the volume of X(r) is 



V r = [ dn(r). 
Jx(r) 
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The volume and index of a congruence subgroup are related by 

V r = [SL 2 (Z) : {±I}r} V SJj2{z) . (5.15) 
To construct the Petersson inner product, take any /, g £ S k (r) and let 
f(r) = f(T)gJ^)(lm(r)) k for r e H. 
This is clearly continuous. It is invariant because for any 7 e r, 
f(l(r)) = /( 7 (r))^F))(Im( 7 (T))) fc 

= (.f[7] fc )(r)j(7,r)' £ ( 5 [7] fe )(r)j(7,T)' £ (Im(r)) fe | J (7,r)r 2fe 
= (fh] k )(r)WlkW)(Mr)) k 

= <p(r) since / and g are weight-fc invariant under T. 

With _T-invariance established, showing that <p is bounded on H reduces to 
showing that ip is bounded on 1J ai(T>), and since the union is finite it suffices 
to show that for any a G SL 2 (Z), ip o a is bounded on T>. Being continuous, 
ip o a is certainly bounded on any compact subset of T>. As for neighborhoods 
{Im(r) > y} of zoo, first note the Fourier expansions 

00 00 
(/[«]*) to = 5>„(/[a] fe )C ( 5 [a] fc )(r) = £ 

n— 1 n— 1 

where <ft = e 2mT / h for some /1 e Z+. Each of these is of the order qh (written 
0{qu)) as Im(r) — > 00. Thus as above, 

p(a(r)) - (/H/ £ )(r)( 5 H/ £ )(r)(Im(r)) fe = 0( % ) 2 (lm(r)) fe 

by the Fourier expansions. Since \qu\ = e - 27rIm ( T )/ /l and exponential decay 
dominates polynomial growth, ip(a(r)) — > 0 as Im(r) — > 00 and pa is 
bounded on V as desired. This shows that in the next definition the integral 
is well defined and convergent. 

Definition 5.4.1. Let r C SL 2 (Z) be a congruence subgroup. The Petersson 
inner product, 

(,) r -.s k (r)xs k (r)^c, 

is given by 

(f,9)r = ±- I f(r)g(V)(Im(r)) k d^r). 
v r Jx{r) 

Clearly this product is linear in /, conjugate linear in g, Hcrmitian- 
symmetric, and positive definite. The normalizing factor 1/Vr ensures that if 
r' C r then ( , ) r , = ( , ) r on S k (r) (Exercise 5.4.3). When the subgroup r 
is clear from context, the inner product is written ( , ) without the subscript. 
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Inspecting the argument that the integral defining the Petersson inner 
product converges shows that in fact this holds so long as the product fg 
vanishes at each cusp. In particular it holds when only one of / and g is 
a cusp form. Phrasing Definition 5.4.1 and the various results in the next 
section to encompass this additional generality is a bit cumbersome, so we 
don't bother since our main objects of study via the Petersson inner product 
are cusp forms. At the end of this chapter, however, we will revisit Eiscnstein 
series briefly, showing among other things that the Petersson inner product of 
an Eisenstein series and a cusp form is always 0. That is, Eisenstein series and 
cusp forms are in some sense orthogonal, but this statement is an abuse of 
language since the Petersson inner product does not converge on all of Aik(r), 
possibly diverging for two noncusp forms. 

Exercises 

5.4.1. (a) Show that the hyperbolic measure, d^(r) = dxdy/y 2 for r = x + 
iy € H, is GL^ (R)-invariant. (A hint for this exercise is at the end of the 
book.) 

(b) Let ip : % — > C be continuous and bounded, and let a € SL 2 (Z). 
Show that the integral J v , <p(a(T))dfj,(T) converges. 

5.4.2. What is V SL2(Z )? 

5.4.3. Show that if i~" C r C SL 2 (Z) arc congruence subgroups then ( , )r' = 
( , )r on Sk(r). (A hint for this exercise is at the end of the book.) 

5.4.4. Let FcTc SL 2 (Z) be congruence subgroups with —I e J". Suppose 
that / e Sfe(r) C S k (r') and that g e S k (r'). Letting T = [j^'a,, recall 
the trace of g from Section 5.1, tr g = ^[a^fe € Sk(r). Show that 

Vr>{f,g)r>=Vr{f,tig) r . 
(A hint for this exercise is at the end of the book.) 

5.5 Adjoints of the Hecke Operators 

Recall that if V is an inner product space and T is a linear operator on V, 
then the adjoint T* is the linear operator on V defined by the condition 

(Tv, w) = (v, T*w) for all v, w e V. 

Recall also that the operator T is called normal when it commutes with its 
adjoint. 

Continuing to work in the space Sk(r\(N)) endowed with the Petersson 
inner product, the next task is to show that the Hecke operators (n) and 
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T n for n relatively prime to N are normal. To do so, we need to compute 
their adjoints. This first requires a few technical points. If r C SL 2 (Z) is a 
congruence subgroup and SL 2 (Z) = \J-{±I}raj and a e GL^ (Q) then the 
map % — > % given byrH a(r) induces a bijection a~ 1 ra\'H* — > X(r). 
Thus the union (J^ a~ 1 otj(T>*) represents the quotient space a~ 1 ra\H* up to 
some boundary identification. Analogously to the last section, for continuous, 
bounded, a^ 1 /^-invariant functions ip : H — > C define 

Ja- 1 ra\'H' j JV 

Lemma 5.5.1. Let r C SL 2 (Z) be a congruence subgroup, and let a G 
GL+(Q). 

(a) If ip :rl — > C is continuous, bounded, and F '-invariant, then 

\ ip(a(T))dfi(T) = / ^(t)g^(t). 

Ju- 1 ra\u-' Jx(r) 

(b) Ifa^Ta C SL 2 (Z) then V a -i Fa = V r and [SL 2 (Z) : a^Ta] = [SL 2 (Z) : 

n 

(c) There exist /?!,...,/?„ e GL+(Q), where n = [r : a^Ta n T] = [r : 
a.FoT x n r], such that 

r a r = \Jrp j = \Jp j r, 

with both unions disjoint. 

Proof. Part (a) is immediate. 

The first formula in (b) follows from (a) and the definition of volume. The 
rest follows from formula (5.15) and the fact that —I e a~ 1 Ta -<==^> —I e r. 

For (c), apply (b) with aFoT 1 n F in place of F to get 

[SL 2 (Z) : oT x Fol flT] = [SL 2 (Z) : aFaT 1 n T], 

and the same formula holds with F replacing SL 2 (Z). Thus there exist 
71 , . . . , 7„ and 71 , . . . , 7„ in F such that 

f = [jia^ra n r) 7j = (J^rc*- 1 n r)^ 1 , 

both unions disjoint. Recall that Lemma 5.1.2 relates coset representatives in 
the quotient space to orbit representatives in the double coset. Setting /"i and 
r 2 in the lemma to r here, and setting a in the lemma first to a here and 
then to a -1 here gives 



rar = (J rajj and fa" 1 / 1 = |J fa" 1 ^ 1 , 
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both unions disjoint. The second formula is also _Ta_T = {J-jjOtr. For each 
j, the left and right cosets Fa^j and "/jaT have nonempty intersection, for 
otherwise rccyj C Ui^j 7i a -^i an d multiplying from the right by r gives 
rar C [Ji^j JiO>r, a contradiction. So for each j, pick some /3j € Ta-fjCi-yjar . 
Then Tar = [J r/% = (J /^T as desired. □ 

With the technicalities established, the next proposition shows how to 
compute adjoints. 

Proposition 5.5.2. Let r C SL 2 (Z) be a congruence subgroup, and let a e 
GL+(Q). Set a' = det^cT 1 . Then 

(a) If a' 1 Tot C SL 2 (Z) then for all f e S k {r) and g e S k (a- l ra), 

(f[a]k,g) a -ir a = (f,9[a']k)r- 

(b) For allf,geS k (r), 

(f[rar} k ,g) = (f, g [ra / r} k ). 

In particular, if a Ta = r then [a]* k — [a'] k , and in any case [Tar]* k = 
[raT] k . 

Proof. For part (a), expand the [a] k operator, note that a' acts as a -1 on H*, 
and apply Lemma 5.5.1(a) to get 

/ (f[a] k )(r)glV)(lm(T)) k dn(r) 
Ja- 1 ru\w 

(det a) k -\f{a{rm ai T)- fe ^)(Im(r)) fe ^(r) 

a- 1 ra\W 

= f (det a ) fe -V(r)jXa,a'(r))- fc ^ 7 (7))(Im( a '(T))) fe d M (r). 

Jx(n 

The cocycle condition j(aa',r) — j(a, a'(r))j(a', t), the upper half plane 
identity Im(a'(T)) — (det a')Im(r)|j(a', r)| -2 , and the observation det a' = 
det a reduce the integral to 

f(r)(g[a'] k )(r){lMr)) k d^r). 

x(r) 

Since also V a -i ra = Vp by Lemma 5.5.1(b), the result follows. 

For part (b), the relation Tar — [_] r@j from Lemma 5.5.1(c) shows that 
the {{3j} can serve in Definition 5.1.3 of the operator [ra>r] k . And the relation 
Tar = {Jfyr gives raT = \J r P'j where P'j = det(/?)/r\ showing that 
the {(3 j} can serve in the definition of [raT] k . Now the result is immediate 
from (a) with each r n PjTPj 1 in place of r, 
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(f[rar] k ,g) r = ^(f[0j]k,9)pyi r p jn r 

3 

= E^'5[/3-] fe )rn ft r /3 - 1 = (f,g[raT] k ) r . 

3 

□ 

The adjoints of the Hecke operators follow from the proposition. 

Theorem 5.5.3. In the inner product space Sk(Ti(N)) , the Hecke operators 
(p) and T p for p\ N have adjoints 

{p)* = {p)- 1 and Ip = <!>>-% 

Thus the Hecke operators (n) and T n for n relatively prime to N are normal. 

Proof. Take any f,ge 5 fc (A(iV)). Recall that A (AT) is normal in r 0 (N), so 
Proposition 5.5.2(a) gives 

(p)* = [a]* k for any a G r 0 (N) such that a = [op] (mod N) 

= [a-% = (p)- 1 . 

For the second formula, Proposition 5.5.2(b) gives 

T* p = [A (TV) [Jp] A(JV)f t = [A (AO [g °] 

To study this double coset, note that for m and n with mp — nN = 1, the 

matrix in its middle satisfies [ g ° ] = [ at mp ] [ o p ] [ iv m 1 ■ The ^ rs ^ ma trix 
of this triple product lies in A (TV), the third in r 0 (N). Thus, substituting the 
triple product for [g °] and recalling again that A (TV) is normal in r 0 (N) 
shows that 

A (AO [g ?] A(iV) = A (AT) [J p] A (AT) [£ . 

Now if A (AT) [o°] A (AT) = UA(AT)/3j is the decomposition of the dou- 
ble coset describing T p , then the display shows that A (AO [g ?] A (A 7 ") = 
(J A(A r )/3j [ jy ^] is the decomposition for Tp*. Since to = (mod A 7 ), the 
result A* = (p) -1 T p follows. □ 

From the Spectral Theorem of linear algebra, given a commuting family of 
normal operators on a finite-dimensional inner product space, the space has 
an orthogonal basis of simultaneous eigenvectors for the operators. Since each 
such vector is a modular form we say eigenform instead, and the result is 

Theorem 5.5.4. The space <Sfc(A(A0) has an orthogonal basis of simultane- 
ous eigenforms for the Hecke operators {(n),T n : (n,N) = 1}. 

The next few sections will partly eliminate the restriction that (n, N) = 1. 
Note that when (n,N) > 1, we have (n)* =0* =0 (the zero operator). 
And Exercise 5.5.1 shows that for any Hecke operator T = T n or T = (n), 
whether n shares factors with N or not, 

T* = wnTw^ 1 where wjy is the operator [[jv _ o]]fc- (5.16) 
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Exercise 

5.5.1. (a) Let 7 = [^ ~o] ■ Establish the normalization formula 





a b 




d —c 


7- 1 


Ned 


7 = 


-Nb a 



Use this to show that 7 1 Ti(N) r Y = A (A 7 ), and so the operator wjy — [y]k 
is the double coset operator [A(A07A(A0]fc on ,S fe (A(A0). Show that 
WNinjw^ 1 — (n)* for all n such that (n, N) — 1 and thus for all n. 

(b) Let A (AT) [JO] A (A 7 ") = UAWft be a distinct union, i.e., T p = 
X^[/3j]fe- From part (a), -fT^N) = A (-^07 and similarly for -f^ 1 . Use 
this and the formula [q°J = 7 _1 [Jp] 7 to find coset representatives for 
ri(N) [qO] A (A 7 ). Use Proposition 5.5.2 and the coset representatives to 
show that T* = w^T p w^ 1 , and so T* = wiqT n w^ 1 for all n. 

(c) Show that w* N = (—l) k WN and that i k WNT n is self-adjoint. (A hint for 
this exercise is at the end of the book.) 



5.6 Oldforms and Newforms 

So far the theory has all taken place at one generic level N. This section begins 
results that move between levels, taking forms from lower levels M \ N up to 
level N, mostly with M = Np -1 where p is some prime factor of N. 

The most trivial way to move between levels is to observe that if M | N 
then 5 fc (A(M)) C S fe (A(A0) (Exercise 5.6.1). 

Another way to embed <Sfc(A(-W)) into <Sfc(A(A/")) is by composing with 
the multiply-by-d map where d is any factor of N/M. For any such d, let 



ad = 



dO 
0 1 



so that (/[a d ] fc )(r) = d k -\f(dT) for / : H — > C. By Exercise 1.2.11, the 
injective linear map [ad]k takes Sk(r\(M)) to <Sfc(A(A0), lifting the level 
from M to N. 

Combining the observations so far, it is natural to distinguish the part of 
<Sfe(A(A r )) coming from lower levels. 

Definition 5.6.1. For each divisor d of N , let id be the map 

i d : (Skir^Nd- 1 ))) 2 — ► 5 fe (A(A0) 

given by 

if, 9) !-> f + g[<xd]k- 

The subspace of oldforms at level N is 
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5 fc (A(iV)) old = £ ip ((s k (r 1 (N P - 1 ))f) 

p\N 
prime 

and the subspace of new forms at level N is the orthogonal complement with 
respect to the Petersson inner product, 

5 fe (A(iV)) now - (S fc (A(A0) ol T- 

Exercise 5.6.2 shows that changing the definition of oldforms to sum over all 
divisors of N, rather than just the prime divisors, makes no difference. For 
newspace dimension formulas see [Mar]. The term "newform" will be given a 
more specific meaning in Definition 5.8.1 to follow. 

The Hecke operators respect the decomposition of <Sfc(A(-W)) into old and 
new. 

Proposition 5.6.2. The subspaces S fe (A(A0)° ld and S fc (A(A0) now are sta- 
ble under the Hecke operators T n and (n) for all n e Z+. 

Proof. Let p \ N. The argument breaks into cases. First, let T = (d) with 
(d, N) = 1 or let T = T p > for p' a prime other than p. Then the diagram 

SfctAtJVp- 1 )) 2 — [ "^ ] ) SfcCACJVp- 1 )) 2 

Zp lp 

5 fc (A(JV)) 7 >S fc (A(A0) 

commutes (Exercise 5.6.3(a)); note that the operator T means one thing at 
level Np -1 in the top line of the diagram and something else at level N in 
the bottom line. 

Second, continuing to take p | N, the diagram 

r T pP *-n 

gfcCACJVp- 1 )) 2 L " <P> ° J > ^(A(^" 1 )) 2 

Zp £p 

S k ( A (AO ) > 5fc(A(A0) 

commutes as well (Exercise 5.6.3(b)). The two diagrams combine to show that 
S k (r 1 (N)) old is stable under all T n and (n) (Exercise 5.6.3(c)). 

To establish the result for 5fc(A(A r )) new it is enough to show that 
Sk(ri(N)) old is stable under the adjoints of T n and (n) for all n (Exer- 
cise 5.6.3(d)). Since T* = (n)" 1 ^ and (n)* = (n)" 1 when (n, N) = 1, and 
since (n)* = 0 when (n,N) > 1, the result is clear in these cases. But dis- 
cussing T* when (n,N) > 1 requires Exercise 5.5.1: T* = wTntu^ 1 where 
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w = [[_ N o]]fe- Thus it suffices to show that the oldforms are preserved under 
the injective linear map w. This follows from the commutative diagram 

ro P k - 2 wi 

SkWNp-i)) 2 lw ° > 5fc(A(jVp~ 1 )) 2 



5 fe (A ( TV) ) - > S k (A (TV) ) 

(Exercise 5.6.3(e); note that like the Hecke operators, the w operator means 
different things at different levels: in the top row of the diagram it is w = 

fc). □ 



w N 



0 1 

-Np- 1 0 



Corollary 5.6.3. The spaces S fe (A(TV)) old and S fe (A(TV)) now ftaue orthogo- 
nal bases of eigenforms for the Hecke operators away from the level, {T n , (n) : 
(n,TV) = l}. ' 

As we will see, the condition (n, TV) = 1 can be removed for the newforms. 



Exercises 

5.6.1. Show that if M \ N then S^AfM)) C 5 fe (A(^))- 

5.6.2. If pd | iV then what relation holds between i p d(Sk{ri(N p^ 1 dr 1 )) 2 ) and 
i p (5 fe (A(iVp- 1 )) x 2 d (5 fc (A(A r ^ 1 ^ 1 )) 2 ))? (Here i pd is from level Np- 1 ^ 1 
to level N, i p is from level Np^ 1 to level TV, and id is from level Np^ 1 ^ 1 to 
level Np^ 1 .) Show that changing the definition of 5fe(A(^V)) old to sum over 
all divisors of TV doesn't add anything to the space of oldforms. 

5.6.3. (a) Show that the first diagram commutes. (Hints for this exercise are 
at the end of the book.) 

(b) Show that the second diagram commutes. 

(c) Explain how the two diagrams show that 5 fe (A(^V)) old is stable under 
all T n and (n). 

(d) How does showing that <Sfc(A(TV)) old is also stable under the adjoints 
of T n and (n) for all n establish the rest of the proposition? 

(e) Show that the third diagram commutes. 



5.7 The Main Lemma 

Let M | TV and let d \ (TV/M), d > 1. Thus A(M) D A(^0- 

From the previous section, recall two maps from <Sfc(A(-M)) to <Sfc(A(-W)): 
inclusion and the weight- k operator [ajfc, where ad = [q ?] so that the opera- 
tor is — up to a scalar multiple — composition with the multiply-by-d map. To 
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normalize the scalar to 1, define a variant id of the map id from the previous 
section, 

i d = d^Mfc : 5 fc (A(M)) — ► Sk^N)), (i d f)(r) = f(dr), 
acting on Fourier expansions as 

oo oo 

id ■ a "1 n ^ a nq dn , where q = e 2mT . 

n—1 n—1 

This shows that if / € Sk(r\(N)) takes the form / = J2 p \n L pfp with each 
f p G Sk(ri(N/p)), and if the Fourier expansion of / is /(r) = ^a n {f)q n , 
then a n (f) = 0 for all n such that (n, iV) = 1. The main lemma in the theory 
of newforms is that the converse holds as well. 

Theorem 5.7.1 (Main Lemma). If f £ 5fe(/i(iV)) has Fourier expansion 
/(r) = ^ a n (f)q n with a n (f) = 0 whenever (n, N) — 1, then f takes the form 
f = Y. p \n hip with each fp e Ski^iN/p)). 

The Main Lemma is due to Atkin and Lehner [AL70]. The elegant proof 
presented here is due to David Carlton [Car99, CarOl], using some basic re- 
sults about group representations and tensor products. Explaining these from 
scratch would take the exposition too far afield, so the reader should consult 
a text such as [FH91] (Chapters 1 and 2) as necessary. A simpler instance of 
the tensor product will be presented in Chapter 6. An elementary proof of the 
Main Lemma is in [Lan76], and see also [Miy89]. 

The first step is to change congruence subgroups, moving the congruence 
conditions from the lower left matrix entry to the upper right one. This sim- 
plifies the Main Lemma by converting the map i p to inclusion. Analogously 
to ri(N), define for any positive integer N 



r x (M) 



a b 
c d 



€ SL 2 (Z) : 



a b 
c d 



1 0 

* 1 



(mod N) 



Lemma 5.7.2. aMTi{M)a M = ^(M), and the same formula holds with M 
replaced by N. 

The proof is Exercise 5.7.1. 

Thus the two maps M k ~ 1 [a'^]k : 5 fe (A(M)) — ► S k {r 1 (M)) and 
N^i^N^k ■ 5fe(A(7V)) — > S k {r 1 {N)) are isomorphisms. The first map 
takes /(t) to /(r/M) and thus in terms of Fourier expansions it takes ^ a n q n 
to ^2a n q1f where qu = e 2mT ^ M = q x l M . Similarly for the second map with 
N replacing M. 

Since r\M) D A (A), we have ^(/^(M)) C S k (r 1 (N)), so inclusion 
makes sense as the bottom row in the (not necessarily commutative) diagram 
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5 fe (A(M)) 



->S fc (A(AO) 



s k (r\M))^s k (r\N)). 

In terms of Fourier expansions, the top row and the columns are 

E ««<Z" l ► E an9 d " 



E««9m 



and so the diagram commutes when N = dM. This condition holds when 
d = p and M = N/p, reducing the Main Lemma to 

Theorem 5.7.3 (Main Lemma, second version). If f € S k (r 1 (N)) has 
Fourier expansion /(r) = E ct n(/) < ?]v w ith a n (f) = 0 whenever (n,N) = 1, 
then f = E P |AT f P with each f P e S k (r 1 (N/p)). 

The second step is to translate the Main Lemma's hypothesis into linear 
algebra. This is done by constructing a suitable projection operator. For any 
positive integer to recall the group 



r°(m) 



a b 

c d 



€ SL 2 (Z) : 



a b 
c d 



* 0 



(mod to) 



For any d \ N, let r d — A(-W) H r a (N/d), a congruence subgroup of level AT. 
Lemma 5.7.4. 4 se£ o/ representatives for the quotient space r(AT)\A; is 



1 bN/d 
0 1 



: 0 < b < d 



□ 



Proof. Exercise 5.7.2. 

Averaging over the coset representatives gives the trace operator, 

tt, : s k (r(N)) — ► s fe (r(A0), 7r d (/) = 1 • J2f{[l b T]^ 

6=0 

a projection to <Sfc(Ai)- (Recall that a projection is an idempotent linear map, 
i.e., 7r^ = ir d .) In terms of Fourier series, a geometric sum calculation (Exer- 
cise 5.7.3) shows that ttj, preserves the terms whose indices are multiples of d 
and kills everything else; that is, 

oo 

n=l n : d\n 
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It follows that Ttd 1 d 2 = 7T d 1 ^d 2 = 7T d 2 7r d 1 when d\d2 \ N. 
Now define tt : S k (r(N)) — > S k (r(N)) to be 

n = l-J2 n P+ ^PiP? = II( 1 ~ 7r p)' 

p\N Pl \N, p 2 \N p\N 

Pl<P2 

where the product denotes composition. By nature of the Wd and the additive 
expression for tt, the Inclusion-Exclusion Principle shows that tt preserves the 
part of / away from N and kills the rest, 

oo 

n=l n : (n,N) = l 

Thus the hypothesis of the Main Lemma is that / £ Sk(r 1 (N)) n ker(7r). 
Since the 7r p 's are commuting projections, facts from linear algebra about 
commuting projections (Exercise 5.7.4) show that 

ker(7r) = ker(^Q(l - tt p )) = J^ker(l - ir p ) = ^im(7r p ). 

p\N p\N p\N 

And recall that tt p is a projection of S k (r(N)) to S k (r p ), 

im(7r p ) = s k (r p ) = 5 fc (A(JV) n r°(N/ P )). 

Soker(Tr) = T, P \N S k( r i( N )^ r °( N /p))- Now it suffices to prove "c" (though 
we'll prove equality, no harder since the other inclusion is obvious) in 

Theorem 5.7.5 (Main Lemma, third version). 

s k (A (N) ) n s * (A ( N ) n r ° Wp) ) = Y s ^ rl Wp) ) ■ 

p\N p\N 

This version of the Main Lemma reduces to group theory. The group G = 
SL2(Z/7VZ) acts on the complex vector space S k (r(N)) from the right via 
the weight-fc operator. (The full modular group SL2(Z) acts and its normal 
subgroup r(N) acts trivially, so the quotient acts and the quotient is naturally 
isomorphic to G.) If the prime factorization of N is N — Y\7=iPi' then G is 
naturally identified with a direct product, 

n 

G = Y[G t , each G l = SL 2 (Z/p e i i Z). 

»=1 

For i = 1, . . . , n define subgroups of Gi, 

H i = r 1 ( P pyr(pt*), 

Jf i = (r 1 ( P r)nr 0 ( P r 1 ))/r( P r). 
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Lemma 5.7.6. For p prime and e > 1, 

(r V), n TV" 1 )) = ^ V" 1 )- 

Proof. To stay with the main argument, this will be done afterward. □ 

Let H = n"=i^i- Using superscripts to denote subspaces fixed by sub- 
groups of G, Lemma 5.7.6 shows that Theorem 5.7.5 rewrites as 

n n 

s k (r(N)) H n£s fc (r(JV))*« = Y, s k( r (*0) {H ' K,) i^.n) 

i=l i=l 

(Exercise 5.7.5). We now quote a result from representation theory: the vector 
space Sk(r(N)) is a direct sum of subspaces irreducible under the G-action. 
So (Exercise 5.7.6) condition (5.17) follows from a purely group-theoretical 
result, 

Proposition 5.7.7. Let V be an irreducible representation of the group G = 
riiLi Gi- Let H = nr=i ^ an< ^ ^ = 11™= l ^ e subgroups. Then 

v H nf2v Ki = it v{H ' Ki) - ( 5 - 18 ) 

»=1 i=l 

Quoting another result from representation theory, under the action of G = 
Iir=i ^ the irreducible vector space V decomposes as a corresponding tensor 
product V = 0™ =1 Ui with each G, acting on V*. Establishing Proposition 5.7.7 
is just a matter of decomposing each component Vj according to the various 
groups involved. Specifically (Exercise 5.7.7), each V$ has pairwise linearly 
disjoint subspaces V^i, V^2, ^3 such that 

vf Hi ' Ki} = V a , V* 1 * = Va®V i2 , V** = V a ®V i3 . (5.19) 

Now proving the proposition reduces to inspecting the various terms in (5.18). 
Conditions (5.19) show that 

v H = (V11 e V12) o • • • ® (Fii ev i2 )®-® (Ki e K 2 ), 

n n 

U U (5-20) 

n n 

J2 y {H ^ = ^(u n e v 12 ) ® • • • ® v a ® • • • ® (v nl e v n2 ). 

i=l i=l 

The last result we need to quote is that tensor products and direct sums 
of vector spaces satisfy the distributive law. Now (5.20) shows that the ten- 
sor products belonging to V H and contributing to V Ki are precisely the 
products contributing to ^ v^ H,Ki \ That is, multiplying the three right sides 
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out and converting the second and third ones to direct sums by eliminating 
redundant factors shows that the third is the intersection of the first two 
(Exercise 5.7.8). The proposition follows and the Main Lemma is proved. 
Finally, we need to go bade and establish Lemma 5.7.6, that 

(r 1 (p e ),r 1 (p e )nr°(^- 1 )) = r 1 (p e - 1 ). 

The containment "C" is clear, so only "D" is needed. Let r denote the group 
on the left side. Let m = [° J] € -T 1 ^ 6-1 ). It suffices to show that some 
clement of r-mr lies in r, so the proof will repeatedly replace m by 77717' 
with 7, 7' e r. 

First eliminate the cases p | a and p | d, which can occur only if e = 1 . If 
p I a then p \ 6 and so m [ \ ?] = satisfies p { a + b. Since [ J J] e r we 

may take p \ a. The argument for p \ d is similar. 

Next reduce to the case 6 = 0 (mod p e ) and c = 0 (modp e ). Let 
/3 = —bd^ 1 (mod/) (which makes sense now that p \ d), satisfying 6 + 
dp = 0 (mod p e ) and /3 = 0 (mod p 6 " 1 ). Then [Jf] e T and [Jf]m = 
[ a+ c c/3 b+ d d/3 ] , so we may take 6=0 (mod p e ). The argument for c = 0 (mod p e ) 
is similar and slightly easier, left multiplying m by suitable [~ ?] . 

So now 777 = [" J] with a = = 1 (mod p e_1 ), 6 = c = 0 (mod p e ). Since 
det 777 = 1, we have ad = 1 (mod p e ). Consider the matrix 





1 0 




1 1 -d 




1 0 






-1 1 




0 1 




a 1 





a + a(l — ad) 1 — ad 
ad — 1 g? 



Inspecting the multiplicands shows that 7 € f. Inspecting the product shows 
that 7 = m (mod/), so 7777^ = [J J] (mod p e ) and 7777^ e I 1 as well. 
Thus m e r and the proof is complete. 

Exercises 

5.7.1. Prove Lemma 5.7.2. 

5.7.2. Prove Lemma 5.7.4. (A hint for this exercise is at the end of the book.) 

00 

5.7.3. Show that Tr^^a^) = ^ a n q^. 

n=0 n:d\n 

5.7.4. State and prove the facts about commuting projections needed in the 
calculation ker(7r) = X^pijv ^ m { n p)- (A hint for this exercise is at the end of 
the book.) 

5.7.5. Use Lemma 5.7.6 to explain why Theorem 5.7.5 rephrases as condi- 
tion (5.17). (A hint for this exercise is at the end of the book.) 

5.7.6. Show that Proposition 5.7.7 implies condition (5.17). (A hint for this 
exercise is at the end of the book.) 
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5.7.7. In the proof of Proposition 5.7.7, explain why each Vi has pairwise 
linearly disjoint subspaces satisfying conditions (5.19). 

5.7.8. Letting n — 2, multiply out the right sides of (5.20), express the results 
as direct sums, and show that V H n £ V Ki = E V {H ^ . 

5.8 Eigenforms 

Recall from Corollary 5.6.3 that the spaces S k (r 1 (N)) old and 5 fc (ri(^)) now 
have orthogonal bases of eigenforms for the Hecke operators {T„, (n) : 
(n, N) = 1}. Let / be such an eigenform. Using the Main Lemma, this section 
will show that if / e 5fe(ri(AT)) ncw then in fact / is an eigenform for all T n 
and (n). If (n, N) > 1 then (n) = 0 (the zero operator) and so (n)f = 0 (the 
zero function) , meaning / is an eigenform for all (n) , with eigenvalues d n — 0 
when (n,N) > 1. Thus the only operators in question are the T n . 

Proposition 5.2.3 showed a similar result for Eisenstein series. The series 
'*(t) where i/"/ 3 = X modulo N is an eigenform for the Hecke operators 
(d) and T p where p \ N, and when t = 1 the scries is an eigenform for all T p . 
To allow the discussion here to cover Eisenstein series as well as cusp forms, 
definitions and results are given for all modular forms when appropriate. 

Definition 5.8.1. A nonzero modular form f G Mk(ri(N)) that is an 
eigenform for the Hecke operators T n and (n) for all n <E Z+ is a Hecke 
eigenform or simply an eigenform. The eigenform /(t) = J2^ =Q a n (f)q n 
is normalized when a\(f) = 1. A newform is a normalized eigenform 
m5 fe (A(7V)) new . 

The definition precludes the zero form consistently with the general defi- 
nition of eigenvector from linear algebra. In the parlance of this definition, we 
are about to show that 5fc(-Ti(-^)) new has an orthogonal basis of newforms. 

Let / e Sk(ri(N)) be an eigenform for the Hecke operators T n and (n) 
with (n, N) = 1 . Thus for all such n there exist eigenvalues c n ,d n € C such 
that T n f = c n f and (n)f = d n f. The map n h-> d n defines a Dirichlet char- 
acter x ■ (Z/iVZ)* — > C* (Exercise 5.8.1), and / e S k (N,x). Consequently 
formula (5.14) applies and says 

ai (T n /) = a n {f) forallneZ+. (5.21) 

Since / is an eigenform away from the level, also 

a\(T n f) = c n ai(f) when (n, N) = 1, 

so 

««(/) = c„ai(/) when (n, iV) = 1. 

Thus if ai(/) = 0 then a„(/) = 0 when (n, N) = 1 and so / e 5 fc (ri(A^)) old 
by the Main Lemma. 
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Now assume that / G 5 fc (A(iV)) ncw , / ^ 0. Then / £ 5 fc (ri(A^)) old 
and the preceding paragraph shows that a\(f) ^ 0, so we may assume / is 
normalized to ai(/) = 1. For any m G Z + let g m = T m f — a m {f)f, an element 
of Sk(ri(N)) ncw and an eigcnform for the Hecke operators T„ and (n) for 
(n,N) — 1 (Exercise 5.8.2). Compute that its first coefficient is 

ai{g m ) = ai(T m f) - ai(a m (/)/) 

= a m {f) - a m (f) by (5.21) and because ai(/) = 1 
-0, 

showing that g m G Sk(ri(N)) old by the argument of the preceding paragraph. 
So g m G 5 fc (r 1 (7V)) ncw n5 fe (A(iV)) old = {0}, i.e., T m f = a m (f)f. Since m is 
arbitrary this discussion proves most of 

Theorem 5.8.2. Let f G SkiT^N)) 11 ™ be a nonzero eigenform for the Hecke 
operators T n and (n) for all n with (n,N) = 1. TTien 

(a) f is a Hecke eigenform, i.e., an eigenform for T n and (n) for all n G Z + . 
^4 suitable scalar multiple of f is a newform. 

(b) // / satisfies the same conditions as f and has the same T n - eigenvalues, 
then f = cf for some constant c. 

The set of newforms in the space Sk(ri(N)) nev/ is an orthogonal basis of 
the space. Each such newform lies in an eigenspace Sk(N,x) and satisfies 
T n f = a n (f)f for all n G Z + . That is, its Fourier coefficients are its T n - 
eigenvalues. 

Proof. All that remains to prove is that the set of newforms in the space 
Sk(ri(N)) ncv/ is linearly independent. To see this, suppose there is a nontrivial 
linear relation 

n 

»=i 

with all Ci nonzero and with as few terms as possible, necessarily at least two. 
For any prime p, applying T p — a p (f\) to the relation gives 

n 

Y c i(«p(/i) - a p(h))h = 0- 

i=2 

This relation must be trivial since it has fewer terms, so a p (fi) = a p (fi) for 
all i. Since p is arbitrary this means that fi = f\ for all i, giving a contradiction 
since the original relation has at least two terms. □ 

The newforms in each diamond operator eigenspace Sk(N, x) ncw therefore 
are an orthogonal basis of the eigenspace. Part (b) in the theorem is the Mul- 
tiplicity One property of newforms, showing that the basis of 5fc(r 1 (A^)) ncw 



5.8 Eigenforms 197 



contains one element per eigenvalue where "eigenvalue" means a set of T n - 
eigenvalues {c„ : n e Z + }. That is, each cigenspace for the T n operators 
is 1-dimensional. Exercise 5.8.3 demonstrates a subspace of Sk{r\(N)) with- 
out a basis of eigenforms. If / e Sk{r\{N)) is an eigenform then / is old 
or new, never a hybrid / = g + h with g old and h new and both nonzero 
(Exercise 5.8.4). 

The functions fk(r) — rj{r) 'r](N V) from Proposition 3.2.2 are newforms, 
each spanning Sk(r 1 (N)) ncw and also spanning Sk(r 0 (N)) ncw when k is even. 
The reader is strongly urged to browse the online modular forms explorer at 
William Stein's web site [Ste] to see numerical examples of newforms. The ex- 
plorer also displays examples of many other constructs associated to modular 
forms, some of which will be explained in the following chapters here. Much 
of the information in the online explorer comes from John Cremona's tables 
[Cre97], also recommended to the reader who is interested in computational 
aspects of modular forms. 

Theorem 5.8.3. The set 

Bk{N) = {/(nr) : / is a newform of level M and nM \ N} 
is a basis ofSk(Ti(N)). 

Proof. (Partial.) Consider the decomposition 



The first term on the right side is spanned by the elements of Bk(N) with 
M = N and n = 1, and by induction on the level each summand in the 
second term is spanned by /(r) and f(pr) for newforms / of level dividing 
N/p. Thus B k {N) spans S^r^N)). 

To show that Bk(N) is linearly independent, suppose there is a nontrivial 
relation 



Here each fi lies in a space Sk(Mi, Xi) with Mi \ N and \i a Dirichlet charac- 
ter modulo Mi, and each n^- divides iV/Mj. Assume that each c^j in (5.22) 
is nonzero and that the relation has as few terms as possible. It must involve 
more than one fi since for a given i each jth function fiijii^r) — q ni - j + • • • 
starts at a different index. Each \i lifts to a Dirichlet character \i mod- 
ulo N, and then fi G Sk(N,Xi)- In fact all Xi must lift to the same char- 
acter modulo N, for if (say) Xi(c?) ^ XiiA) f° r some d € (Z/NZ)* then 
applying (d) — Xi(rf) to (5.22) yields a nontrivial relation with fewer terms 
(Exercise 5.8.5(a)). Similarly all fi have the same Fourier coefficients away 
from N, for if (say) a p (fi) ^ a P {f2) for some p\ N then applying T p — a p {f\) 
to (5.22) also yields a nontrivial relation with fewer terms (Exercise 5.8.5(b)). 



5 fc (A(J>0) = S fe (A(A0r w © ^((^(AWp))) 2 ). 



p\N 




(5.22) 
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Thus (5.22) involves distinct forms /j e Sk(Mi,Xi) with all Xi lifting to the 
same character modulo N and all a p (fi) equal for p \ N . By a result known 
as Strong Multiplicity One, these conditions make all of the fi equal, contra- 
dicting the possibility of linear dependence. □ 

Strong Multiplicity One also plays a role in the proof (Exercise 5.8.6) of 

Proposition 5.8.4. Let g € Sk(Ti(N)) be a normalized eigenform. Then 
there is a newform f £ <S fe (r , 1 (M)) new for some M \ N such that a p (f) = a p (g) 
forallp\N. 

The level M of the newform / in the proposition is called the conductor 
of f . The result is analogous to the fact that any Dirichlet character \ mod- 
ulo N has a corresponding primitive character Xprim modulo the conductor M 
of x such that x prim (p) = x(p) for all p\ N. 

We do not prove Strong Multiplicity One. Sec for example [Miy89], where 
the argument cites a result beyond the scope of that book as well. The argu- 
ment here that Bk(N) spans Sk(r\(N)) is complete, and this suffices for most 
purposes. 

The last result of this section holds for all modular forms. 

Proposition 5.8.5. Let f e M. k (N, x)- Then f is a normalized eigenform if 
and only if its Fourier coefficients satisfy the conditions 

(iW/) = i, 

(2) a p r(f) = a p (f)a p r-i(f) - x(p)p k 1 a p —^{f) for all p prime and r>2, 

(3) a mn (f) = a m (f)a n (f) when (m,n) = 1. 

The proposition does not say that any function /(r) = ^ n a n {f)q n with 
coefficients satisfying conditions (1), (2), and (3) is a normalized eigenform. 
The function need not be a modular form at all. 

Proof. The forward implication ( =>■ ) follows from the definition of T n in 
Section 3 (Exercise 5.8.7). For the reverse implication ( ), suppose / 
satisfies the three conditions. Then / is normalized, and to be an eigenform 
for all the Hecke operators it need only satisfy a m {T p f) = a p {f)a m {f) for all 
p prime and m e Z+. If p \ m then formula (5.14) gives a m (T p f) — a pm (f) 
and by the third condition this is a p (f)a m (f) as desired. On the other hand, 
if p | m write m = p r m' with r > 1 and p \ m' . This time 

a m {T p f) = <V+i ro ,(/) + x{p)p k ~ 1 a p r-i m ,(f) by formula (5.14) 

= (V +1 (/) + x(.P)P k ~ la pr-i(.f))am>(.f) by the third condition 
= a p (f)a p r(f)a m i(f) by the second condition 

= a p (f)a m (f) by the third condition. 



□ 
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In particular, Proposition 5.2.3 shows that this result applies to certain 
Eisenstein series Ef' v /2. That is, the Fourier coefficients a n (f) — ai'^n) 
satisfy the three relations of Proposition 5.8.5. Indeed, the proof of Proposi- 
tion 5.2.3 essentially was to verify this. 

Exercises 

5.8.1. Show that if / is an eigenform for the operators (n) with eigenvalues d n , 
then the map n4 d n defines a Dirichlet character \ '■ (Z/NZ)* — ► C*. 

5.8.2. Show that the function g m in the section is an cigcnfunction for the 
Hecke operators T n and in) when (n,N) = 1. 

5.8.3. Let / e <S 2 (rb(H)) be the unique normalized eigenform, cf. Exer- 
cise 3.5.4. 

(a) Compute that a 2 (/) = —2. 

(b) Define A(t) = /(2t), / 2 (t) = /(4r), and / 3 (r) - /(8r). Show that 
the set [3 = {/, /i,/2,/s} is linearly independent in <S 2 (_r 0 (88)). (A hint for 
this exercise is at the end of the book.) 

(c) Let V = span(/3). Show that V is stable under the Hecke operators, 
computing the matrix [T p ]p describing the action of T p on V with respect to (3 
for each prime p. 

(d) Show that there are exactly three normalized eigenforms in V. Thus 
this problem exhibits a space without a basis of eigenforms. 

5.8.4. Show that if / <E Sk(ri(N)) is an eigenform then / is old or new. (A 
hint for this exercise is at the end of the book.) 

5.8.5. (a) Show that if Xi{d) ^ X2{d) for some d G (Z/NZ)* then applying 
(d) — Xi(d) to (5.22) yields a nontrivial relation with fewer terms. 

(b) Show that if a p (f\) ^ a p (f 2 ) for some p \ N then applying T p — a p (fi) 
to (5.22) yields a nontrivial relation with fewer terms. Why does this require 
p \ Nl 

5.8.6. Prove Proposition 5.8.4 as follows. 

(a) Supply details to this argument: Suppose that for each newform /, of 
level dividing TV there exists a prime Pi\ N such that a Pi (fi) ^ a Pi (g). Write 
g in terms of the spanning set provided by Theorem 5.8.3 and then apply the 
operator HiC^Pi — a p.(/i)) to obtain a contradiction. 

(b) According to Strong Multiplicity One, the newform / for which the 
equality in the proposition holds is unique. Deduce that g is in the span of 
{/ (nr) : nM \ N} where M is the level of /. 

5.8.7. Prove ( ) in Proposition 5.8.5. 
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5.9 The connection with X-functions 

Each modular form / e A4k(ri(N)) has an associated Dirichlet series, its L- 
function. Let /(r) = Y^=o a n<l n , let s € C be a complex variable, and write 
formally 

oo 

L(sJ) = a n n~ s . 

n=l 

Convergence of L(s, f) in a half plane of s-values follows from estimating the 
Fourier coefficients of /. 

Proposition 5.9.1. If f E A4k(r\(N)) is a cusp form then L(s, f) converges 
absolutely for all s with Re(s) > k/2 + 1. If f is not a cusp form then L(s, f) 
converges absolutely for all s with Re(s) > k. 

Proof. First assume / is a cusp form. Let g(q) — J2^=i a nQ n , a holomorphic 
function on the unit disk {q : \q\ < 1}. Then by Cauchy's formula, 

a n = [ g{q)q~ n dq/q for any r G (0, 1) 
J\ q \ =r 

= f f(x + iy)e- 2mn( - x+lv Ux for any y > 0, where q = e 27vi( - x+iy) 

= e 27T [ f(x + i/n)e- 2mnx dx letting y = 1/n. 

Jx=0 

Since / is a cusp form, Im(T) fe / 2 |/(r)| is bounded on the upper half plane T-L 
(Exercise 5.9.1(a)), and so estimating this last integral shows that \a n \ < 
Cn k / 2 . The result for acusp form / now follows since |a„n _;s | = 0(n k ^ 2 ^ Rc ^). 

If E is an Eisenstein series in Ai/ c (r 1 (N)) then by direct inspection 
its Fourier coefficients satisfy \a n \ < Cn k ^ 1 (Exercise 5.9.1(b)) and now 
|an« s | = 0(n k ~ 1 ~ Rc ( s * > ). Since any modular form is the sum of a cusp form 
and an Eisenstein series the rest of the proposition follows. □ 

The estimate \a n (f)\ < Cn k / 2 for / e S^r^N)) readily extends to 
Sk(r(N)) and therefore to Sk(r) for any congruence subgroup r of SL 2 (Z). 
Similarly for the estimate |a n (.E)| < Cn k ^ 1 for Eisenstein series E € 
Mk(r(N)). The upshot is that every modular form with respect to a congru- 
ence subgroup satisfies condition (3') in Proposition 1.2.4, 

(3') In the Fourier expansion /(r) = Y^=o a n < lNi the coefficients satisfy the 
condition 

\a n \ < Cn r for some positive constants C and r. 

So finally the converse to that proposition holds as well: if / is holomorphic 
and weight- k invariant under _T then / is a modular form if and only if it 
satisfies condition (3'). 

The condition of / being a normalized cigenform is equivalent to its L- 
function series having an Euler product. 
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Theorem 5.9.2. Let f e Mk{N,x), f(r) — Y^=Q a nQ_ n - ^ e following are 
equivalent: 

• f is a normalized eigenform. 

• L(s, /) has an Euler product expansion 

Hs, /) = H(l - a pP - s + X {p)p k - 1 - 2s )-\ 
v 

where the product is taken over all primes. 

Note that the Euler product here is the Hecke operator generating function 
product (5.12). 

Proof. By Proposition 5.8.5, the first item here is equivalent to three condi- 
tions on the coefficients a n , so it suffices to show that those conditions are 
equivalent to the second item here. 

Fix a prime p. Multiplying condition (2) in Proposition 5.8.5 by p~ rs and 
summing over r > 2 shows, after a little algebra, that it is equivalent to 

oo 

53 Oprp"" • (1 - a pP ~ s + X (p)p k - 1 - 2s ) = oi + (1 - ai )p- s . (5.23) 

r=0 

If also condition (1) in Proposition 5.8.5 holds then this becomes 

oo 

53 OprjT" • (1 - a p p- s + X {p)p k - X - 2s ) = I- (5-24) 

r=0 

Conversely, suppose (5.24) holds. Letting s — > +oo shows a\ = 1 so condi- 
tion (1) in Proposition 5.8.5 holds, and so does (5.23), implying condition (2) 
in Proposition 5.8.5. So conditions (1) and (2) in Proposition 5.8.5 are equiv- 
alent to 

oo 

53 a r p- rs = (1 - a pP - s + x(p)/" 1 " 25 )" 1 for p prime. (5.25) 

Before continuing, note that the Fundamental Theorem of Arithmetic (pos- 
itive integers factor uniquely into prime powers) implies that for a function g 
of prime powers (Exercise 5.9.2), 

oo oo 

nE^ r )=En^ r )- (5.26) 

p r—0 n— l p r \\n 

The notation p r \\n means that p r is the highest power of p that divides n, and 
we are assuming that g is small enough to justify formal rearrangements. 

Now, if (5.25) holds along with condition (3) of Proposition 5.8.5 then 
compute 
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L(s, /) = ^ a n n s = ^ I TJ a p r- \n s by the third condition 



n— 1 n— 1 



= E II v-f>~ rs = II E v^~ rs b y ( 5 - 26 ) 

n=l p^Hn P r=0 

= - a pP~ S + X{p)p 1 - k - 2s r 1 by (5.25), 
p 

giving the Euler product expansion. 

Conversely, given the Euler product expansion, compute (using the geo- 
metric series formula and (5.26)) 

L(s, f) = - a p p- s + X (p)p 1 - k - 2s r 1 

P 

CO 

some {b P:r } 

p r— 0 

oo oo / 

= EIlw- rs = E \U b 



n=ln r \\n n=l 



p,r 

p r \\n 



So a n = Y[ p r\\ n bp,r> giving condition (3) of Proposition 5.8.5 and showing in 
particular that b p . r = a p r. This in turn implies (5.25), implying conditions (1) 
and (2) of Proposition 5.8.5. □ 

As an example, the L-function of the Eisenstein series E^' v /2 works out 
to (Exercise 5.9.3) 

L{s, = L{s, 4,)L{s — k+l,tp) (5.27) 

where the L-functions on the right side are as defined in Chapter 4. For another 
example see Exercise 5.9.4. 

Let N be a positive integer and let A be the ring Z[/X3]. For any char- 
acter x ■ (A/NA)* — > C* , Section 4.11 constructed a modular form 
6 X e Mi(3N 2 ,ip) where ip(d) = x(d)(d/3). Recall that x needs to be triv- 
ial on A* for 0 X to be nonzero, so assume this. The arithmetic of A and 
Theorem 5.9.2 show that 6 X is a normalized eigenform. The relevant facts 
about A were invoked in the proof of Corollary 3.7.2 and in Section 4.11. To 
reiterate, A is a principal ideal domain. For each prime p = 1 (mod 3) there 
exists an element ir p e A such that 7r p 7f p = p, but there is no such element if 
p = 2 (mod 3). The maximal ideals of A are 

• for each prime p = 1 (mod 3), the two ideals (n p ) and (7f p ), 

• for each prime p = 2 (mod 3), the ideal (p), 

• for p = 3, the ideal (\/— 3). 
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Let tt p = p for each prime p = 2 (mod 3), let 7r 3 = and take the set of 

generators of the maximal ideals, 

S = {7Tp,7f p : p = 1 (mod 3)} U {n p : p = 2 (mod 3)} U {7r 3 }. 

Then each nonzero n e A can be written uniquely as 

n = u TT 7r a " , u e A*, each a w € N, a w = 0 for all but finitely many 7t. 

ttGS 

Correspondingly x(n) = fl^es xW 1 " ■ The Fourier coefficients of 6 X were 
given in (4.50), 

a m (0 x ) = \ *( n )- 

|n| 2 — m 

Compute that therefore 



L(8,0 X ) = | X! X(n)\n\~ 2s =U(l xWkr 28 )" 1 = [J^A), 
where (Exercise 5.9.5) 



ra6A 7rG5 
n#0 



'1-(XK) + X(^)K S + X(PK 2S ifp^l (mod 3), 
L p (s,e x )-'= {1- X {P)P~ 2S h> = 2(mod3), 

1-X(\/^3)3 _S ifp = 3. 

(5.28) 

Since L p (s,6 x ) = (1 - a p (9 x )p~ s + ^(p)? -25 ) -1 m au cases, Theorem 5.9.2 
shows that 6 X is a normalized eigcnform. 



Exercises 

5.9.1. (a) For any cusp form / £ 5fc(i"i(Ar)) show that the function if(r) = 
Im(r) fc / 2 |/(r)| is bounded on the upper half plane %. (A hint for this exercise 
is at the end of the book.) 

(b) Establish the relation 1 < (7j._ 1 (n)/n fe ~ 1 < C(k — 1) where £ is the 
Ricmann zeta function. Show that the Fourier coefficients a n of any Eisenstein 
series satisfy \a n \ < Cn k ^ 1 . 

5.9.2. Prove formula (5.26). (A hint for this exercise is at the end of the book.) 

5.9.3. Prove formula (5.27). What is a half plane of convergence? 

5.9.4. Recall the functions /, fx, fi, and / 3 from Exercise 5.8.3. The exercise 
showed that the 4-dimensional space spanned by these functions contains only 
three normalized eigenforms. How do the L- functions of the three eigcnforms 
relate to L(s, /)? 



5.9.5. Establish formula (5.28). 



204 5 Hecke Operators 

5.10 Functional equations 

Let /(t) = Yl^Li a nq n € Sk(ri(N)) be a cusp form of weight fc. Its associated 
L-function is L(s,f) — Y^=i a n nS > convergent for Re(s) > fc/2 + 1. As in 
Chapter 4, the Mellin transform of / is 

f°° dt 
g(s)= / /(**)*• — 
Jt=o 1 

for values of s such that the integral converges absolutely. 

Proposition 5.10.1. The Mellin transform of f is 

g(s) = (2ir)- s r(s)L(s, /), Re(s) > fe/2 + 1. 

This is shown exactly as in Chapter 4 (Exercise 5.10.1). This section shows 
that the function 

A N (s) = N s / 2 g(s) 
satisfies a functional equation. Define an operator 

W N : 5 fc (A(J\0) — ► 5 fc (ri(JV)), / ^ i fc JV 1_fc/2 /[[^ "5]]fc- 

That is, (Wjv/)(t) = i k N- k / 2 T~ k f{-l/(Nr)). This is essentially the w N op- 
erator from the end of Section 5.5. The operator Wm is idempotcnt, mean- 
ing W N is the identity operator (Exercise 5.10.2), and similarly to Exer- 
cise 5.5.1(c) Wm is self-adjoint, 

{W N h , / 2 > = (fi , , /i , h G 5 fc (A (iV) ) . 

Letting <Sfc(A(A0) + and <Sfc(A(A0)~ denote the eigenspaces 

5 fe (AW) ± - {/ e 5 fc (A(JV)) : WV/ = ±/} 

gives an orthogonal decomposition of <Sfc(A(-W)) under the Hecke algebra, 

s k (r 1 (N)) = s k (r 1 (N))+®s k (r 1 (N))-. 

Theorem 5.10.2. Suppose f G <S/ S (A(^V)) ± - Then the Mellin transform 
A N (s) extends to an entire function satisfying the functional equation 

A N {s) = ±A N {k-s). 

Consequently, L(s, f) has an analytic continuation to the full s-plane. 

Proof. Take / e 5 fe (A(iV)) ± and compute 

A N (s) = iV*/ 2 r /(it)ff = r f{itl>/NW^. 

Jt=o 1 Jt=o 1 
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Since f(it/V~N) is of order e 27r */v n ag ^ ^ oo, the piece of this integral from 
t = 1 to oo converges to an entire function of s. For the other piece compute 
that (W N f)(i/(y/Nt)) = t k f(it/^), so that 

dt 

T 

/ (w N mt/VN)t k - s T 
jt=i t 



r f{it/VN)t sdi - = r (w N m/(VNt))t s ~ k 

Jt=o 1 Jt=o 

(W N f)(it/VN)t k ~ s 



Since W^f = ±/ this also converges to an entire function of s. Thus A N (s) has 
an analytic continuation to the full s-plane. To obtain the functional equation 
note that in total the integral is 

A N (s) = jH (f(it/VN)t s + (W N f)(U/VN)t k ~ s ) j. 

Since WnJ — ±/ this is ±/ljy(fc — s), completing the proof of the functional 
equation. □ 

It is natural to wonder about a converse to Theorem 5.10.2, a result that 
Dirichlct series with analytic continuations and functional equations come 
from modular forms. Such a theorem exists due to Hecke and Weil, though it 
is not quite as straightforward as one might guess. For a summary discussion 
see [Kob93], and for the full story see [Ogg69], [Bum97], or [Miy89]. 

Exercises 

5.10.1. Prove Proposition 5.10.1. 

5.10.2. Show that Wn is idempotent. 



5.11 Eisenstein series again 

Recall from Section 4 that the integral defining the Petersson inner product 
converges when at least one of / and g is a cusp form, not necessarily both. 
In particular it is meaningful to take the inner product of an Eisenstein series 
and a cusp form. This section will show that such an inner product always 
vanishes, meaning that the Eisenstein series are orthogonal to the cusp forms. 
(As mentioned before, this usage of "orthogonal" is a slight abuse of language.) 
Thus the Eisenstein spaces can be redefined as complements rather than as 
quotient spaces. 

Specifically, recall the Eisenstein series with parameter associated to the 
group r(N) in Section 10 of Chapter 4, 



E v k ( T ,s) = e N Mr) s [l}k, Re(fc + 2 S )>2. 

~te(P+nr(N))\r(N)8 
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Here e N is 1/2 if N £ {1, 2} and 1 if N > 2, v = (c v , d v ) is a vector in (Z/NZ) 2 , 
S = [c d ] 1S a ma trix in SL 2 (Z) whose bottom row (c v ,d v ) is a lift of v 
to Z 2 , and P + = {[g"]:neZ}is the positive part of the parabolic subgroup 
of SL 2 (Z). The weight-fc operator in this context is 

(fb}k)(r,s) = i(7,r)- fe /(7(r),s) for 7 e SL 2 (Z). 

Analytically continuing E% (r, s) in the s-plane to s = 0 recovered the Eisen- 
stein series E%(t) from earlier in Chapter 4. At weight k = 2 the continued 
series are nonholomorphic at s — 0 but linear combinations cancel away the 
nonholomorphic terms. Bearing all of this in mind, redefine 

£ k (r(N)) = the holomorphic functions in span({£f(r, 0) : v G (Z/NZ) 2 }). 

So now £k(r(N)) is a subspace of Aik(r(N)) linearly disjoint from the 
cusp forms Sk(r(N)), replacing its earlier definition as the quotient space 
M k (r(N))/S k (r(N)). That is, 

M k (r(N)) = s k (r(N)) e £ fc (r(jv)). 

To show that this decomposition is orthogonal introduce the notation 



p+(n) = P+n r(N) 



1 nN 
0 1 



: n £ Z 



so a fundamental domain for the orbit space P+(N)\H* is the set 
V* N = {t £ H* n C : 0 < Re(r) < A^} U {oo}. 

Let 

P+(7V)\r(A^) = |J P+{N) ai and r(7V)\SL 2 (Z) = (J T(iV)^ 
so that 

P + (iV)\SL 2 (Z) = yP + (7V)a i /? j , and ^ = [J Qi A,(D*). 

(For instance, the decomposition T>\ = Ui a j(^'*) i s partially illustrated in 
Figure 2.4.) For any cusp form / £ Sk(r(N)) the relation ao(/) — 0 is 

/ f(x + iy)dx = 0 for any y > 0, (5.29) 

similar to Exercise 1.1.6 (Exercise 5.11.1(a)). Consequently for any s £ C with 
Re(fc + 2s) > 0 we can evaluate the absolutely convergent double integral 



DC 



N 



f(x + iy)y k+s - 2 dxdy = 0, (5.30) 

=0 Jx=0 
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or in terms of the complex variable t, recalling the measure dfi(r) = dxdy/y 2 , 

f f(T)(lm( T )) k+s d»(T) = 0 (5.31) 
(Exercise 5.11.1(b-c)). This relation rewrites as 

° = E / /(a^(r))(Im(a^(r))) fc + s ^(r) 

If Rc(fc + 2s) > 2 then the sum over i converges absolutely and passes through 
the integral, giving 

0 = E / m'(r))Ef^(0Ar),s)/e N ■ (Im(^(r))) fc ^(r). 

This sum of translated integrals is by definition the integral over the modular 
curve X(r(N)), cf. Section 4, and in fact it is a constant multiple of the 

Petersson inner product of the cusp form /(t) and E^'^{r, s). Thus, the 
inner product is 0 for all s in a right half plane, and the relation analytically 
continues to s = 0, 

a^r y >r W =o. 

Consequently for any 7 e SL 2 (Z), 

(f,Ef^) r{N) = (.f,Ef^[j] k ) r{N) = (fh-%,Ef^) r{N) 

by Proposition 5.5.2(a) generalized to the Petersson inner product of a cusp 
form and a general modular form. Since also /[7 _1 ]fc € Sk(r(N)) the product 
is 0. Thus the cusp forms and the Eisenstein series are orthogonal at level N 
as claimed. 

For any congruence subgroup r at level N redefine 

£ k {r) = £ k {r{N))nMk{r), 

and this applies in particular when _T = ri (TV). Redefine for any Dirichlct 
character \ modulo N 

£k(N,x)=£k(ri(N))nM k (N, X ). 

For any congruence subgroup r the Eisenstein space is linearly disjoint from 
the cusp forms and similarly for the cigenspaces, 

M k (r) = S k (r) ($ £ k (r) and M k (N, X ) =S k (N, X )®£k(N,x), 
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and the decompositions are orthogonal. The sets of Eisenstein series specified 
in Chapter 4 as coset representatives for bases of the Eisenstein spaces as 
quotients are now actual bases of the Eisenstein spaces as complements. 

Using Proposition 5.8.5, it is easy (except when k = 2 and x = 1) to 
prove that 5fc(/i(iV)) _L £k(N, x) with Hecke theory rather than the previous 
integral calculation for the larger space Mk(r(N)). Exercise 5.11.2 gives the 
argument. 



Exercises 



5.11.1. (a) Let the modular form / e Aik(r(N)) have Fourier expansion 
/(r) = J2n=o a n q% where q N = e 2mT ' N . Show that for all n > 0, 

1 r N 

a n = - f(x + iy)e- 2 * m{ - x+ ^t N dx for any y > 0, 

iV Jx=0 

so in particular (5.29) holds. 

(b) Assuming absolute convergence, show that the integral in (5.30) is 0. 

(c) For any cusp s — a(oo) with a e SL 2 (Z) show by changing variable 
that the integral in (5.31) over a neighborhood of s takes the form 

/ /( a (r'))j(«,r') fe Im(r') fc+s |j(«^')r 2(fe+s) ^(r')- 

J\m(T')>y Q 

Explain why this integrand decays exponentially as Im(r') — >• oo if Re(fc + 
2s) > 0. 

5.11.2. (a) Let / € Sk(ri(N)) be a normalized eigenform with eigenvalue a p 
under the Hecke operator T p for each prime p. Show that / has eigenvalue a p 
under the adjoint T*. (Hints for this exercise are at the end of the book.) 

(b) Let x be a Dirichlct character modulo N. Let ip and ip be primitive 
Dirichlet characters modulo u and v such that uv \ N and (ip(p)(— 1) = (— l) fe 
and ipip — x at level N, excluding the case k = 2, = = li- Recall that 
Ef ,ip /2 is a normalized eigenform. Let its T p -eigenvalues be denoted b p . Show 
that b p ^ a p for some prime p. 

(c) Show that Ei' v _L /. 



(d) Let t be a positive integer such that tuv \ N. Let a = 



t- L o 

0 1 



so that 



a' = (1/t) [ID in Proposition 5.5.2. We know that a^A^a = ^(N/t) n 
r°(t). Note that E^'^' 1 is a nonzero constant multiple of Ef' v [a']k since the 
exceptional case is excluded. For any / € Sk(ri(N)) show that 

{f,Et*[a']k) ri(N) = (f[a)k,Etnr l{ N/ty 



Explain why this is 0 and why consequently all of £k (N, x) is orthogonal to 
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5.11.3. Let A be the ring Z[/X3]. Recall the statement of Cubic Reciprocity in 
Section 4.11: Let d e Z + be cubefree and let N — 3Y\ p ^ d p. Then there exists 
a character 

X : (A/NA)* — ► {1,H3, fi%} 

such that the multiplicative extension of x to all of A is trivial on A* and on 
primes p \ N, while on elements ir of A such that ttt: is a prime p \ N it is 
trivial if and only if d is a cube modulo p. The character x is nontrivial if d > 1 . 
Define ip(n) = x(n)(n/3) (Legendre symbol) for n G (Z/37V 2 Z)* and let 6> x <= 
.Mi(3iV 2 , ■!/>) be the corresponding modular form constructed in Section 4.11, 
shown to be a normalized eigenform at the end of Section 5.9. This exercise 
shows that 0 X is a cusp form if d > 1. The proof is by contradiction. Thus 
take d > 1 and suppose that 9 X is not a cusp form. 

(a) For any positive integers t, u, v such that tuv \ 3N 2 and any Dirichlct 
characters <f> and ip modulo u and v with 0<p = -0 at level 3A^ 2 recall the 
Eiscnstein series #f'^* e £i(3iV 2 ,-0). According to Theorem 4.8.1 the set of 
such Eisenstein series is a basis. Use the idea of Exercise 5.8.6(a) to show that 
there exist such u, v, <f), and ip such that 

a p (Ep v /2) = a p (8 x ) for all p \ 3N 2 . 

(b) Show that <f>{n) = -<p{n) for all n G (Z/3N 2 Z)* such that n = 
2 (mod 3). Deduce that 

0(n) = p(n) g) , ne(Z/37V 2 Z)*. 

(c) Show that </) 2 is trivial and so a p (9 x ) <E {±2} for all p = 1 (mod 3) 
such that p \ N . Deduce that x( 7r ) € {±1} for all primes n of A not dividing 
iV (cf. Section 5.9), so x 1S trivial or quadratic. This contradiction shows that 
9 X is a cusp form. 
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Jacobians and Abelian Varieties 



Let X be a compact Ricmann surface of genus g > 1 and fix a point x a in X. 
Letting x vary over points of X and viewing path integration as a function of 
holomorphic differentials wonX, the map 



When g = 1 this is an isomorphism of Abelian groups. When g > 1 the 
domain X is no longer a group, but the codomain still is. The codomain is the 
Jacobian of X, complex analytically a g-dimcnsional torus C 9 / A g where A g = 
X 2g . This chapter presents the Jacobian and states a version of the Modularity 
Theorem mapping the Jacobian of a modular curve holomorphically to a given 
elliptic curve. The map is also a homomorphism, incorporating group structure 
into the Modularity Theorem whereas the first version, back in Chapter 2, was 
solely complex analytic. This chapter then uses the Jacobian to prove number- 
theoretic results about weight 2 eigenforms of the Hecke operators. It ends 
with another version of Modularity replacing the Jacobian with an Abelian 
variety, a quotient of the Jacobian. The Abelian variety comes from a weight 2 
cigenform, so this version of the Modularity Theorem associates an eigenform 
to an elliptic curve. 

Related reading: The main results of this chapter are from Chapters 3, 7, 
and 8 of [Shi73]. For the basics on Jacobians of Riemann surfaces, see [FK80]. 
For material on complex tori and complex Abelian varieties, see [Swi74]. There 
arc many introductory texts to algebraic number theory, such as [Mar89] 
and [Sam72]. 




is an injection 




linear functions of holomorphic differentials on X 
modulo integration over loops in X 
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6.1 Integration, homology, the Jacobian, and Modularity 

We begin by describing the situation sketched in the chapter introduction 
when g = 1. Let X be a complex elliptic curve C/A. For any point z e C 

z-\-\ 

consider the family of integrals in the complex plane {J Q dQ : A G A} = 
z + A. Each such integral can be viewed as the integral over any path from 0 
to z + A, and two such integrals differ by the integral over any path between 
lattice points Ai and A2 in C. Since d£ is translation-invariant it makes sense 
as a holomorphic differential on X and so path integrals in C project to path 
integrals in X. Two projected path integrals in X from 0 + A to the same 
cndpoint z + A can take different values, coming from plane integrals whose 
difference projects to an integral f d( where a is a loop in X. Thus, viewing 
the cosets z + A in two ways gives a bijection from the elliptic curve to a 
quotient of its path integrals, 

X I path integrals J ^>}/ { integrals J d( over loops a j. 

The bijection is a group isomorphism since computing modulo integrals over 
loops and using the translation-invariance of d(, 

rZi+A rz 2 +A rz 1 +A /•z 1 +z 2 +A pz%+z 2 +A 

/ d(+ / d(= / dC+ / d(= / d(. 

JO+A JO+A JO+A Jzi+A JO+A 

To generalize these ideas to higher genus g > 1 , let X now be any compact 
Riemann surface. The first thing to show is that path integrals on X, similar 
to the path integrals ubiquitous in complex analysis, are well defined. That is, 
if 7 : [0, 1] — > AT is a continuous function and w is a holomorphic differential 
on X then there is a meaningful notion of the integral J oj. 

To see this, begin by supposing that the image of 7 lies in one neighbor- 
hood U with local coordinate ip : U — > V C C and with uj\v = f(q)dq. In 
this case the integral is evaluated in local coordinates, 



1^=1 u\v= f(q)dq. 



(See Exercise 6.1.1 for the definition of the last integral here.) This definition 
seems ambiguous if the image of 7 lies in the intersection U\ n U2 of two 
coordinate neighborhoods, but both coordinates give the integral the same 
value. Indeed, let V\_2 = <Pi(U\ H U2), let V2,i = ^2(^1 H U2), and let <p2,i ■ 
V\_2 — > V2.1 be the transition function between the two coordinate systems. 
By definition of the transition function, by the change of variable formula, 
and by the compatibility condition ^>2i{ UJ \v 2 ) = w |v"n 

/ w|y 2 = / U\v 2 = / ¥>2,lMv 2 ) = / 



6.1 Integration, homology, the Jacobian, and Modularity 213 



So the local integral is the same on overlapping coordinate systems. 

In general, the domain [0, 1] of 7 partitions into finitely many intervals 
each having image in one coordinate neighborhood of X (Exercise 6.1.2(a)). 
The integral over 7 is defined as the corresponding sum of local integrals, and 
it is easy to check that the result is well defined (Exercise 6.1.2(b)). 

With path integration understood we now want to define as path-indepen- 
dent an integral as possible. For arbitrary points x, x' e X, the notion of 
integrating holomorphic differentials from x to x' , 



f u, cue tfjU*), 

J X 



is not well defined since different paths from x to x' might give different values 
for the integral. But if 7 and 7 are paths from x to x' then letting a be the loop 
obtained by traversing forward along 7 and then back along 7, the integrals 
along the two paths differ by the integral around the loop, 



That is, for path-independent integration of holomorphic differentials from x 
to x' we need to quotient away integration over loops. 

Viewing X as a sphere with g handles where g is the genus of X, let 
A\, . . . , A g be longitudinal loops around each handle like arm-bands, and let 
Bi , . . . , B g be latitudinal loops around each handle like equators. We state 
without proof that for any nonnegative integer N, any integers l\, . . . , In, and 
any loops cti, . . . , oln there are unique integers mi, ... , m g , m,...,n g such 
that 

N „ g ,. 9 f 

»=i i=i Ja * »=i Jb * 

(See a Riemann surface theory book such as [FK80] for the unproved assertions 
in this section.) Thus the group of integer sums of integrations over loops is 
the free Abelian group generated by integration over the Ai and the Bi , called 
the (first) homology group of X, 

n 1 {x,z) = z [ e-ez/ ®z / e-ez / =z 29 . 

J A X JAg J Bl JBg 

(Since we deal only with this homology group we won't bother saying "first" 
any more.) Elements of Hi(X, Z) are maps taking the holomorphic differen- 
tials on X to complex numbers, and in fact the homology group is a subgroup 
of the dual space J7j^ ol (X) A = Homc(^hoi(^)> C), the vector space of C-linear 
maps from f2^ ol (X) to C. We also state without proof that 

r[ 8-"8r/ eR / ©-©r / =Ql ol (X) A . 

J A! JAg J Bl JBg 
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Recall from Section 3.4 that dimc(/?hoi(^)) = 9- Tims Hi(X, Z) is a lattice in 
the dual space. The quotient, complex analytically a g-dimensional complex 
torus C 9 /yl g , is the Jacobian. 

Definition 6.1.1. The Jacobian of X is the quotient group 

Jac(X) = ^ ol (X) A /H 1 (X,Z). 

Abel's Theorem relates the Jacobian of X to the divisors on X, as dis- 
cussed in Chapter 3. Let C(X) be the function field of the compact Riemann 
surface X. The degree-0 divisor group of X is 

Div°(X) = £ Ti x X . Jl x £ Z, 72^ = 0 for almost all x, Ti x = 0 > , 
[xex x J 

and the subgroup of principal divisors is all the divisors linearly equivalent 
to 0, 

Div £ (X) = {Se Div°(X) : 5 = div(f) for some / e C(X)}. 

The quotient group of degree-0 divisors modulo principal divisors on X is the 
(degree-0) divisor class group of X or the (degree-0) Picard group of X, 

Pic°(X) = Div°pO/Div £ (X). 

The full Picard group is divisors of all degrees modulo principal divisors, but 
we use only Pic°(X) and so we simply call it the Picard group from now on. 
The Picard group measures the extent to which degree-0 divisors fail to be 
the divisors of meromorphic functions on X. If X has genus g > 0 and x 0 is 
a base point in X then X embeds in its Picard group under the map 

X — >Pic°(X), x^[x-x 0 ], 

where [a; — xo] denotes the equivalence class x — x 0 + Div £ (X), the subtrac- 
tion occurring in Div°(X). (In general there is no subtraction on X.) The 
map is an embedding because there are no functions / € C(X) of degree 1 
(Exercise 6.1.3). The map from degree-0 divisors to the Jacobian 

Div°pO — > Jac(X), n * x ^ n * I t 6 - 1 ) 

x x J*" 

is well defined. 

Theorem 6.1.2 (Abel's Theorem). The map (6.1) descends to divisor 
classes, inducing an isomorphism 

Pic°(X) -A- Jac(X), [j^n^x ^^n x [ . 
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Thus the degree-0 divisors describing mcromorphic functions on X are 
those that map to trivial integration on f2^ ol (X) modulo integration over 
loops. If X has genus g > 0 then its embedding into its Picard group followed 
by the isomorphism of Abel's Theorem shows that the map 

X — > Jac(X), x i y I 

embeds the Riemann surface in its Jacobian. When X is an elliptic curve this 
map is the isomorphism from the beginning of the chapter. Abel's Theorem 
also shows that the finite integer sums of path integrals in X — in fact even just 
the sums whose coefficients sum to 0 (add a suitable multiple of f*°) — make 
up the entire dual space of i7^ ol (A), 

^ 0 iW A = |E n 7^E n 7 = o|- (6.2) 

We will use this freely from now on. 

For example, when X — C the Jacobian is trivial because g = 0, and Abel's 
Theorem states that every degree-0 divisor on the Riemann sphere is principal. 
This is familiar from complex analysis: if the divisor is 5 = z i~ X)-P«+ n coOo 
where the sums are over the desired zeros and poles in C, allowing repetition, 
then the suitable rational function Yi( z ~ Z i)/Y[( z ~Pi) nas this divisor since 
its order at oo is minus the sum of its orders in C. 

For another example, when X = C/A the Jacobian Jac(A) identifies 
with X as explained at the beginning of the section, and the Abel map takes 
divisors ^2n x x to the corresponding sums in X. Abel's Theorem states in 
this case that degree-0 divisors on X such that ^n x x <G A are the divisors 
of elliptic functions with respect to A. This is the constraint obtained in Ex- 
ercise 1.4.1(c) and used in Section 1.4 to describe the group law on elliptic 
curves in C 2 . 

The Modularity Theorem can be stated using the Jacobian. Recall that 
X 0 (N) is the compact modular curve associated to the congruence sub- 
group r a (N). Let Jq(N) denote its Jacobian, 

J 0 {N) = Jac(A o (A0). 

Theorem 6.1.3 (Modularity Theorem, Version Jc). Let E be a complex 
elliptic curve with j(E) <G Q. Then for some positive integer N there exists a 
surjective holomorphic homomorphism of complex tori 

Jo(A0 — ► E. 

The word "homomorphic" here introduces algebraic structure into the 
Modularity Theorem but it comes essentially for free. View Jo(Af) as a com- 
plex torus C 9 1 A g . Proposition 1.3.2 extends to higher dimension: 
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Proposition 6.1.4. Let <p : C 9 /A g — > C h /Ah be a holomorphic map of 
complex tori. Then 

(p(z + A g ) = Mz + b + A h 

where M <E M/^ S (C) is an h-by-g matrix with complex entries such that 
MA g C A h , and beC h . 

This is proved the same way as in Chapter 1 . By topology the map ip lifts 
to a map dp : C 9 — > C h of universal covers, and it follows that dp' is bounded. 
Applying Liouville's Theorem in each direction makes dp' constant, and now 
everything proceeds as before. In particular, if the Modularity Theorem only 
supplied a holomorphic map then this would take the form z+A g n- m-z+b+A 
where now m £ C 9 is a vector such that m ■ A g c A. Translating by —b gives 
a homomorphism as claimed. 

Version Jc of the Modularity Theorem implies Version Xc (from Sec- 
tion 2.5), that every complex elliptic curve E with j(E) e Q is the holomor- 
phic image of a modular curve X 0 (N). Indeed, if E is the image of io(N) 
then Jq(N) is nontrivial, so Xo(N) itself has genus g > 0 and embeds in its 
Jacobian as discussed. The embedding X 0 (N) — > Jo (AT) composes with the 
map Jq(N) — > E of Version Jc to show Version Xc (Exercise 6.1.4 is to 
show that the composition surjects). The converse, that Version Xc of the 
Modularity Theorem implies Version J c , will be shown at the end of the next 
section. 

Exercises 

6.1.1. This exercise defines the integral of a holomorphic differential over 
a continuous path in C, using integration over polygonal paths (which is 
easy) and Cauchy's Theorem for such integration. Let V be an open subset 
of C, let 7 : [0, 1] — > V be a continuous function, and let ui = f(q)dq be a 
holomorphic differential on V. For each t <E [0,1], the image point 7(f) lies 
in some disk D t C V. Show that {-f^ 1 (D t ) : t € [0,1]} is a collection of 
open subsets of [0,1] such that [0,1] = {J t 1~ 1 (D t ). (Intervals [0,6), (a, 1], 
and [0, 1] are open as subsets of [0, 1].) Every open subset of [0, 1] is a union of 
open intervals by definition. Show that since [0, 1] is compact, it is covered by 
finitely many open intervals each taken by 7 into a disk D of V, and therefore 
it partitions into finitely many closed intervals with the same property. Define 
/ w to be the integral of uj along the polygonal path connecting the images 
o? the partition points. Show that this is well defined by using a common 
refinement and Cauchy's Theorem. 

6.1.2. Let X be a compact Riemann surface and let 7 : [0,1] — ► X be 
continuous. 

(a) If {Uj : j € J} is a collection of coordinate neighborhoods in X such 
that X = [J j Uj, show that {j~ 1 (Uj) : j € J} is a collection of open subsets 
of [0, 1] such that [0, 1] = |Jj l~ 1 {Uj)- Show that [0, 1] is covered by finitely 
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many open intervals each taken by 7 into one coordinate neighborhood U 
of X, and therefore it partitions into finitely many closed intervals with the 
same property. Thus, integrating along the path 7 can be defined as a finite 
sum of local integrals. 

(b) Consider two partitions of the domain [0, 1] as in part (a). Show that 
the two corresponding sums of local integrals are equal. 

6.1.3. Use the Riemann-Hurwitz formula from Section 3.1 to show that if X 
has genus g > 0 then there are no functions / € C(X) of degree 1. (A hint 
for this exercise is at the end of the book.) 

6.1.4. In general, if a map ip is surjective this need not imply that a compo- 
sition ip o / is surjective as well. Yet the argument at the end of the section 
asserts that if a holomorphic homomorphism ip : Jo(^V) — > E surjects then 
so does (po f where / : X 0 (N) — ► Jo(N) is an embedding 14 Explain. 
(A hint for this exercise is at the end of the book.) 

6.2 Maps between Jacobians 

Let h : X — > Y be a nonconstant holomorphic map of compact Riemann 
surfaces. This section defines corresponding forward and reverse holomorphic 
homomorphisms of Jacobians, hj : Jac(A) — ► Jac(Y) and h J : Jac(Y) — > 
Jac(X). Since the Jacobian and the Picard group are isomorphic under Abel's 
Theorem, there also are homomorphisms hp and h p of Picard groups. The 
Jacobian maps hj and h J are defined in terms of transferring holomorphic 
differentials between X and Y and have interpretations in terms of transfer- 
ring integration between X and Y. The Picard group maps hp and h p come 
from transferring meromorphic functions between X and Y, independently of 
Abel's Theorem. In this chapter, where our methods are still complex analytic 
in keeping with the book so far, the Picard group and its maps don't play a 
large role, but from the next chapter onwards, when we move from complex 
analysis to algebraic geometry and begin working over fields other than C, 
the Picard group formulation is the one that will continue to make sense. 

To define the forward map of Jacobians in terms of differentials, begin 
with the pullback map induced by h, embedding the function field of Y in the 
function field of X, 

h* : C(Y) — > C(X), h*g = goh. 

Recall from Section 3.1 that each point x <G X has a ramification degree 
e x e Z+ such that h is locally e x -to-l at x. The orders of vanishing of a 
nonzero function and its pullback are related by 



v x {h*g) = e x v h{x) (g), g e C(Y)* 
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(Exercise 6.2.1(a)). Since ramification degree is the same thing as order in 
local coordinates, this says that the order of the composition is the product of 
the orders. In particular, h*g is holomorphic at x if g is holomorphic at h(x). 
The pullback extends to a linear map of holomorphic differentials, 

h* : QUY) — ► ^Li(X). 

To define this, let ipj : Uj — > Vj and tpj : Uj — > Vj be local coordinates 
on X and Y, where h(Uj) = Uj so that pjhipj 1 = hj : Vj — ► Vj, and let 
A be a holomorphic differential on Y. Then the pullback is defined locally in 
terms of the local pullback as 

(h*\)j = h*(\j) e nUVj), x 3 G Qi oX {Vj). 

The required compatibility condition tp^ j{{h*X)k\v k 3 ) = {h*^)jWj k from 
Section 3.3 follows from the known compatibility condition p* k j{Xk\y .) = 
Xj\ v (Exercise 6.2.1(b)). In coordinates, if Xj = g(q)dq then (h*X)j = 
(h*g)(q)h'j(q)dq. The pullback on differentials dualizes to a linear map of dual 
spaces, denoted h* rather than (h*) A , 

K-.nUX) A ^f2i ol (Y) A , h^ = V oh*. 

The forward change of variable formula says that for any path 7 in X, 

[h*X= f A, Xef2UY). (6.3) 

Showing this reduces to finitely many applications of the usual change of 
variable formula on coordinate patches. Especially if a is a loop in X and 

<P e ^oi(^) A is J a then G ^ ol (^) A ^ L h * = k{ a y Since h ^ is 
a loop in Y this shows that ft-* takes homology to homology and therefore 
descends to a map of Jacobians, a holomorphic homomorphism because it 
is induced by a linear map of the ambient complex spaces. This defines the 
forward map of Jacobians in terms of differentials. For any ip G f2^ oi (X) A let 
[ip] = ip + Hi(X, Z) denote its equivalence class in Jac(X), and similarly for Y. 

Definition 6.2.1. The forward map of Jacobians is the holomorphic homo- 
morphism induced by composition with the pullback, 

hj : Jac(X) — > Jac(F), hj[ip] = [h*<p] = [ipoh*]. 

Letting xq be a base point in X as before and writing elements of Jac(X) as 
sums of integrations as in Abel's Theorem, forward change of variable shows 
that the forward map of Jacobians transfers integration modulo homology 
from X to Y by pushing forward the limits of integration, 

fcj(E«x / ) = / • ( 6 - 4 ) 

x J xa x Jh(x 0 ) 
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For example, let X = C/A and Y = C/A' be complex elliptic curves, so 

that their embeddings in their Jacobians, z + Ai-t J 0+A and z + A'i-t J 0+A , , 
are isomorphisms. By Proposition 1.3.2 or its generalization Proposition 6.1.4 
any nonconstant holomorphic map between them takes the form 

h : X — >Y, h(z + A) = mz + b + A', 

where m,kC, m/0, and mA C A'. Thus by (6.4), 

M/ )=/ =/ • 

JO+A Jb+A' JO+A' 

This shows that under the isomorphisms between the complex elliptic curves 
and their Jacobians the map retains its linear part but drops its translation, 

hj : X — >Y, hj(z + A) = mz + A' . 

In particular, if h is an isogeny then hj = h. Let h : X — > Y be another 
isogeny. Then h + h : X — ► Y is an isogeny as well so long as h ^ —h, and so 

(h+h)j = h+h = hj + hj iih + h^O. (6.5) 

That is, taking forward maps of isogenics of complex elliptic curves is additive. 

Returning to the general situation h : X — > Y, there is also a natural 
forward map of Picard groups. The norm map transfers functions forward 
from C(X) to C(Y) (Exercise 6.2.2(a)), 

norim, : C(X) — ► C(Y), (norm, l /)(y) = J] f{x) e *. 

xeh- 1 {y) 

The orders of vanishing of a nonzero function and its norm are related by 
^(norm,,/) = ]T v x (f), f G C(X)* 

(Exercise 6.2.2(b)). Consequently 

div(norm h /) = ^ ^ u x (f) J y = ^ u x {f)h{x). 

That is, the norm takes ^2 v x{f) x to ^ x v x (f)h(x) at the level of principal 
divisors. The map on general divisors that extends this, 

h D : Divpf) — > Div(F), h D ( n x x^ = n x h(x), 

X X 

is a homomorphism taking degree-0 divisors to degree-0 divisors (Exer- 
cise 6.2.3), taking principal divisors to principal divisors since by its definition 
/i£)(div(/)) = div(normft/), and therefore descending to divisor classes. 
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Definition 6.2.2. The forward map of Picard groups is the homomorphism 
h P : Pic 0 '(X)—> Pic 0 \Y), h P [Y,n x x] = [^^(i)]. 

X X 

This map of Picard groups corresponds to the map (6.4) of Jacobians 
under the isomorphism of Abel's Theorem, 

Pic°(X) — > Jac(AT), [^n x x] ^^n x f , 

x x ^ x 0 

and similarly for Y using the base point y 0 = h(xo). That is, the following 
diagram commutes: 

Pic 0 (X)-^> Pic 0 (Y) 



Jac(A) — Jac(Y). 

We could have defined Zip from ftj via this diagram but then it would be 
dependent on complex analysis. Assuming the norm will continue to make 
sense in other contexts, we could have defined hj from hp via this diagram 
but doing so would lose the information that hj is holomorphic. 

Defining the reverse map h J in terms of differentials is more delicate than 
doing so for the forward map. Recall from Section 3.1 that h is a surjection of 
finite degree d, that h is locally e^-to-l at each iglas already mentioned 
here, and that the set of exceptional points in X is defined as £ — {x € X : 
e x > 1}, the finite set of points where h is ramified. Let Y' = Y — h(£) and 
X' = h~ 1 (Y') be the Riemann surfaces obtained by removing the images of 
the exceptional points from Y and their preimages from X. The restriction 
of h away from ramification and its image, 

h : X' — > Y', 

is a d-fold covering map where d — deg(/i). This means that every point y ef 
has a neighborhood U whose inverse image is a disjoint union of neighborhoods 
Ui, . . . , Ud in X' such that each restriction hi : Ui — ► U of h is invertible. 

A theorem from topology says that given a path S in Y' and a point 
x € h^ 1 (S(0)) in X', there exists a unique Zi/t 7 of (5 to X' starting at x, i.e., 
a unique path 7 in X' such that 7(0) = x and /i o 7 = <5. If <5 is a path in Y 
and only its endpoints might lie in h{£) then the Local Mapping Theorem of 
complex analysis shows that for each x € /i _1 (<5(0)) there exist e x lifts of 7 
starting at x, for a total of c? lifts 7 in A". If /3 is a loop in Y' then the map 
taking the initial point of each of its lifts to the terminal point is a permutation 
of the cZ-element set ft- _1 (/3(0)), and so the lifts concatenate to loops a in X' 
corresponding to the permutation's cycles. Any path in X can be perturbed 



6.2 Maps between Jacobians 221 



locally to avoid h(£) away from its endpoints, and by Cauchy's Theorem this 
doesn't affect integration of holomorphic differentials. Thus any path integral 
of holomorphic differentials on Y can be taken over a path 8 such that only 
its endpoints might lie in h{£). In particular, loop integrals in Y can always 
be taken in Y'. 

The trace map induced by h is a linear map transferring differentials 
from X to Y, 

tv h :Q{ ol (X)^Q{ oX (Y). 

If 8 is a path in Y 1 lifting to a path in X 1 and h~ x is a local inverse of h 
about 8(0) taking 8(0) to the initial point of the lift then h~ x has an analytic 
continuation along 8, the chaining together of overlapping local inverses cul- 
minating in the local inverse about 8(1) taking 8(1) to the terminal point of 
the lift. To define the trace, let uo e Q\ oi (X). Suppose y is a point in Y' , so 
that h has local inverses hj 1 : U — > Ui, i = 1, . . . , d. The trace is defined 
on U as the sum of local pullbacks, 

(tv h u>)\ d = J2(K 1 T("\u i ). 

This local definition pieces together to a well defined global trace on Y' be- 
cause analytically continuing the local inverses h^ 1 along any loop in Y' back 
to y permutes them, leaving the trace unaltered. The trace extends holomor- 
phically from Y' to all of Y (Exercise 6.2.4). The trace dualizes to a linear 
map of dual spaces, 

tr£ : VUY) a — ► VUX)\ tr£V - V> ° tr h . 
The reverse change of variable formula for any path 8 in Y 1 , 

f(h- 1 )*u= [ w, wenUx), 

JS Jh~ 1 o5 

is meaningful so long as h^ 1 is understood to be some local inverse of h 
at 8(0) analytically continued along 8, making h^ 1 o 8 the lift of 8 starting 
at h^ 1 (8(0)). This reduces to a finite sum of the same result on coordinate 
patches, where it is just forward change of variable (6.3) with h^ 1 in place 
of h. Summing the reverse change of variable formula over local inverses gives 
for paths 8 in Y' 

f tv h uj= ^ f co, coe QLx(X). (6.6) 

JS lifts 7 J ~t 

This formula extends continuously to paths 8 in Y such that only their end- 
points might lie in h(£). Especially if (3 is a loop in Y' and ip G S7^ ol (Y) A is J 
then tr^tp £ n^ ol (X) A is L tr/j = J^iifts 7 / 7- Since /3 lifts to a concatenation 
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of loops in X this shows that tr£ takes homology to homology and therefore 
descends to a map of Jacobians, again a holomorphic homomorphism because 
it is induced by a linear map of the ambient complex spaces. This defines the 
reverse map of Jacobians in terms of differentials. 

Definition 6.2.3. The reverse map of Jacobians is the holomorphic homo- 
morphism induced by composition with the trace, 

h J : Jac(F) — > Jac(X), h J [ij)] = [if> o tr h ]. 

Writing elements of Jac(F) as sums of integrations per Abel's Theorem, 
the summed reverse change of variable formula (6.6) shows that the reverse 
maps of Jacobians transfers integration modulo homology from Y to X by 
pulling back the limits of integration with suitable multiplicity, 

^ 7 (E%f) = E% E e *f- ( 6 - 7 ) 

y Jy ° y xeh-^y) Jx ° 

We return to the example where X = C/A and Y = C/A' are complex 
elliptic curves and h is a nonconstant holomorphic map, h(z + A) = m,z+b+A' 
with m ^ 0 and mA C A'. The map is unramified and has degree deg(h) = 
[m~ 1 A l : A]. For any i»eC the point w + A' £ Y has inverse image 

h-\w + A') = {m-^w -b) + t + A:te m^A'/A}. 

By (6.7), 

i-w+A' /.m -1 (w—b)+t+A r m~ 1 w+t+A r Y] m -1 w+t+A 

hJ ( ) = E/ =e/ =/ 

JO+A' t J-m- 1 b+A t JO+A JO+A 

and again under the isomorphisms X Jac(X) and Y Jac(F) this is 

h J (w + ^1') = E m ~ lw + t + A. 
t 

Following this by the forward map hj(z + A) = mz + A' from before shows 
that (hj o h J )(w + A') = [m _1 A' : A](w + A'). That is, the composition is 
multiplication by the degree of h on Y, 

hjoh J = [deg(h)}. 

Thus hj is the dual isogeny of h J , making h J the dual isogeny of hj in turn 
as shown in Section 1.3. Consequently if h is an isogeny and h : X — > Y is 
an isogeny as well, then using a result from earlier in the section and a result 
from Section 1.3 that taking duals of isogenics is additive, 
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(h+h) J = {h + h)j = hj + hj by (6.5) 



hj + hj by (1.7) 
h J + h J 



iih+h^O. 



This shows that taking reverse maps of isogenies of complex elliptic curves is 
additive as well. 

The result hj o h J = [deg(h)] is general: for any nonconstant holomorphic 
map h : X — ► Y of compact Riemann surfaces, pullback followed by trace 
multiplies differentials by the degree of the map (Exercise 6.2.5), 

(t r?( o/i*)(A)=deg(/i)A, Xefi^Y). (6.8) 

It follows that hj o h J is multiplication by deg(h) in Jac(Y). We will need 
another consequence in Section 6.6: 

Proposition 6.2.4. Let h : X — ► Y be a nonconstant holomorphic map of 
compact Riemann surfaces. Let h* = (h*) A |h 1 (x,z) ^ e the induced forward 
homomorphism of homology groups, 

K :Hi(X,Z) — ^H!(y,Z), K{Y,n a j) = $>« f . 

Then /i* (Hi (X, Z)) is a subgroup of finite index in Hi(Y, Z). LfV is a sub space 
of f2^ ol (Y) then the restriction h*(Ki(X, Z))|y is a subgroup of finite index 
in Hi(r,Z)| v . 

Proof. Using (6.8) at the last step, compute that 

^(H 1 (X,Z))D((^) A otr^)(H 1 (r,Z)) 

= (tr,, o fc*) A (Hi(Y; Z)) = deg(/ l )H 1 (F, Z). 

Therefore H\(Y, Z)/ft*(Hi(X, Z)) is a quotient of (Z/ deg(/i)Z) 2s . If V is 
a subspace of J7^ ol (F) then the restriction Hi(F, Z) — ► Hi(Y, Z)|y is a 
surjection, and (Exercise 6.2.6) [Hi(Y,Z)|y : h*(Hi(X, Z))|y] is at most 
[Hi(Y,Z) :ft.(Hi(X,Z))]. □ 



To complete the program of this section we need to describe the reverse 
map of Picard groups. This is easy. The pullback h* : C(Y) — >• C(X) 
transfers functions backwards. The earlier formula v x {h*g) = e x Vh(x){g) for 
g £ C(Y) gives 

div(h*g) = <z x v h{x) {g)x = ^ v y{g) Y exX - 

x V xeh- 1 (y) 

That is, the pullback takes J2 y v yi.9)V to J2 y v yi.9) Y, x eh- 1 ( y ) e * x at tne ^ eve ^ 
of principal divisors. The map on general divisors that extends this, 
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h D :Div(F) — ►Div(X), ^(E%y)=E n f E 

is again a homomorphism taking degree-0 divisors to degree-0 divisors (Exer- 
cise 6.2.7(a)), taking principal divisors to principal divisors since by its defi- 
nition h D (div(g)) = div(h*g), and therefore descending to divisor classes. 

Definition 6.2.5. The reverse map of Picard groups is the homomorphism 

h p :Pic°(y)^Pic°(X), h p [J2n y y] = [J2^v £ e * x ] ■ 

y y xeh- 1 {y) 

As with forward maps this correlates via Abel's Theorem to the descrip- 
tion (6.7) of h J so that a diagram commutes: 

Pic°(r) -^Pic°(A:) 

Jac(F) — JacpC). 

Again this means that we could have defined cither of h J and h p in terms of 
the other, but doing so would either leave h p dependent on complex analysis 
or lose the information that h J is holomorphic. It is easy to show that the 
composite hp ° h D multiplies by deg(h) in Div(F) (Exercise 6.2.7(b)), and so 
the result that h P oh p multiplies by deg(ft-) in Pic°(F) holds without reference 
to Jacobians. 

The compact Riemann surfaces in this book other than complex elliptic 
curves are the modular curves. The next section will discuss maps between 
their Jacobians as well. 

Returning to Versions Xc and Jc of the Modularity Theorem, we already 
know that Version Jc implies Version Xc- Conversely, let E be a complex 
elliptic curve with j(E) e Q and let h : Xq(N) — > E be the map guaran- 
teed by Version Xc- Then hj : Jo(-W) — ► Jac(S) is the necessary map for 
Version Jc, remembering that E is naturally isomorphic to its Jacobian. The 
map surjects since hj o h J is multiplication by deg(h), a surjection. 

The myriad functorial adornings of the letter h in this section (see Exer- 
cise 6.2.8) only serve as irritants once the ideas are clear. When we move from 
complex analysis to algebraic geometry in later chapters, forward and reverse 
maps will just be denoted /i* and h* . 

Exercises 

6.2.1. (a) Show that if h : X — ► Y is a nonconstant holomorphic map of 
compact Riemann surfaces and g G C(Y) then u x (h*g) = e x v h ^ x ){g) at any 
point x € X. 
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(b) Show that the pullback ft,* A of a holomorphic differential A on Y sat- 
isfies the compatibility condition on X. 

6.2.2. (a) Show that the norm function defined in the section is meromorphic. 
(A hint for this exercise is at the end of the book.) 

(b) Show that if / e C(X) then v y (norm h f) = J2 x eh- 1 ( y ) v *(f) at am/ 
point y GY. 

6.2.3. Show that the map ho is a homomorphism and takes degree-0 divisors 
to degree-0 divisors. 

6.2.4. This exercise extends the trace holomorphically from Y' to all of Y. 
Part (a) gives an intrinsic argument and part (b) gives a lower-powered argu- 
ment using local coordinates. Fill in details. 

(a) Every path 6 in Y' has d lifts 7 in X' . By definition of the trace, 
J s ti h oj = J2 7 J w for w e f2l ol (X). Let U C Y be homeomorphic to a disk 
and contain only one point y € h(£), and let U' = U — {y} C Y'. Any loop 
(3 in U' lifts to fewer than d loops a in h~ l (U). Each connected component 
of h (U) is homeomorphic to a disk and so Ltr^cj = ^ a | a a; = 0. This 
shows that integrating tr/,w is path-independent in {/'. Let w(y) = Jjtr^(cj) 

where y is any point in U' . Then w is holomorphic on {/' and dw = tr h uj. The 
expression w(y) = J2-y f w shows that w is bounded on U' and hence extends 

holomorphically to U. Therefore so docs tr^. 

(b) Let y G h{£) be the image of an exceptional point. Suppose first that 
the ramification of h over y is total, i.e., h^ 1 (y) is a single point x with 

= deg(h). In suitable local coordinates the map about x is r = h(q) = q e 
(where e = e x ) on some disk V about q = 0. For any nonzero tq € h(V) let 

( ) 1/e : {r:\r- r 0 \<\r Q \}^V 

denote a holomorphic branch of the eth root function so that the local inverses 
of h about r 0 are all of the branches, h^ 1 = fj, z e ( ) x / e for i = 0, . . . , e — 1. Let 
u) = f(q)dq on V. Then the trace on h(V)\{0} is 

e— 1 1 e— 1 

i=0 i=0 

If / has power series expansion /(g) = X^^Lo a n^™ then rearranging a double 
sum shows that locally 

00 

tr fe w = a (TO+ i )e _ir m c{r, 

m=0 

and this is holomorphic at r = 0. In the general case where h~ 1 {y) contains 
more than one point x, each with its ramification degree e x , carrying out this 
process at each x and summing the results shows that tr/jW extends holomor- 
phically to y. 



226 6 Jacobians and Abelian Varieties 



6.2.5. Prove formula (6.8). 

6.2.6. Let 7T : G — > n(G) be a group homomorphism and let H be a subgroup 
of G. Show that [tt(G) : ir(H)] < [G : H]. 

6.2.7. (a) Show that the map hP is a homomorphism and takes degree-0 
divisors to degree-0 divisors. 

(b) Show that the composition hp, o h D multiplies by deg(h) in Div(y). 

6.2.8. Briefly summarize the relations among h, h*, h* , norm/j, tr^, hj, h J , 
hp,, hP , hp, and h p . 

6.3 Modular Jacobians and Hecke operators 

The double coset operators from the beginning of Chapter 5 lead naturally 
to maps between Jacobians of modular curves. In particular, letting Ji(-/V) 
denote the Jacobian associated to the group ri(JV), 

Ji(JV) = Jac(Xi(JV)), 

the Hecke operators act naturally on Ji(N). This section explains these ideas. 

Recall the double coset operator from Section 5.1: Let A and A be con- 
gruence subgroups of SL2(Z) and let a € GL^(Q). Set A = a^ 1 T\a n A, 
and take representatives {^2,3} such that A\A = UjA72,j- Then {(3j} = 
{«72,i} arc in turn representatives such that A £2/2 = Uj A/^j- Also letting 
A = aF 3 a _1 = A H ar 2 or x gives the configuration 

r 2 i a r 3 ► a 

where the group isomorphism is 7 1— > o^a -1 and the other arrows are in- 
clusions. The groups have modular curves X\, X 2j X 3 , and X' 3 , and their 
corresponding configuration is 

x 2 ^x 3 ^^x 3 ^x 1 

where the modular curve isomorphism is A T i->- r 3 a(r), denoted a. Each 
point of X 2 is taken back by m o a o -k^ 1 to a multiset of points of X\, 

r 2 r ^ {r 372j (r)} {r^-(r)} {A^-(t)}, 

where tt^" 1 takes a point to its overlying points, each with multiplicity accord- 
ing to its ramification degree, 7r 2 _1 (x) = {e y ■ y : y e -X'3, TT 2 (y) — x}. This 
leads to an interpretation of the double coset operator as a reverse map of 
divisor groups, 
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[Aar 2 ] 2 : Div(X 2 ) — > Div(Xi), 

the Z-linear extension of the map F 2 t i-> . ri/3j(r) from Jf 2 to Div(Xi). 
Using the language of the previous section, we now recognize all of this as 
saying that the double coset operator on divisor groups is a composition of 
forward and reversed induced maps, [Fiar 2 ] 2 = (tti) d o a D o n®- It therefore 
descends to the corresponding map of Picard groups, denoted by the same 
symbol, 

[r ia r 2 } 2 = (tti)p o a P o tt 2 p : Pic°(X 2 ) — > Pic 0 ^), 

given by 

[Aar 2 ] 2 [^n r r 2 r] = [£n r ^AftW]. 

T T j 

To rephrase this in terms of Jacobians and modular forms, let r be a 
congruence subgroup of SL 2 (Z). By Section 3.3 the holomorphic differentials 
f2fr ol (X(r)) and the weight 2 cusp forms S 2 (r) are naturally identified — every 
cusp form / describes a holomorphic differential (essentially /(r)dr but 
suitably defined on X(r)) and every holomorphic differential takes the form 
ui(f) for some cusp form /. That is, uj : S 2 (r) — ► f2^ ol (X(r)) is a linear 
isomorphism. The dual spaces are consequently identified as well under w A , 

s 2 (rr=^(QUx(nr). 

In the modular context, let H\(X(r), Z) denote the corresponding subgroup 
of S 2 (r) A , what would be denoted oj A (Ri(X(r) 7 Z)) in the language of dif- 
ferentials. It is natural to redefine the Jacobian of X(r) as a quotient of the 
dual space of weight 2 cusp forms, 

Definition 6.3.1. Let r be a congruence subgroup o/SL 2 (Z). The Jacobian 
of the corresponding modular curve X(r) is 

jac(x(r)) = 5 2 (r) A /H 1 (x(r),z). 

Under this change of terminology the induced maps of the previous section 
need to be rephrased in terms of functions rather than differentials. Let X 
and Y be the modular curves associated to congruence subgroups i~x and Fy 
of SL 2 (Z). Let a £ GL^(Q) be such that ctFxoT 1 C Fy, and consider the 
corresponding holomorphic map, 

h:X — >Y, h(r X T) = r Y a( T ). (6.9) 

The weight-2 operator on functions is compatible with the pullback on differ- 
entials in the sense that the following diagram commutes (Exercise 6.3.1(a)): 
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s 2 (r Y )^Us 2 (r x ) 



The induced forward map on dual spaces correspondingly becomes 

K :S 2 (r x ) A ~^S 2 (r Y ) A 1 h.<p = <po[a] 2 . 

Similarly, if ar x a~ l \r Y = \Jj &r x a ^ Y j then the following diagram com- 
mutes (Exercise 6.3.1(b)): 



(6.10) 



s 2 {r x f-^^ s 2 {r Y ) 



(6.11) 



Letting tr^ also denote the top map, the induced reverse map on dual spaces 
is thus 

tr£ : S 2 (/V) A — ► S 2 {r x )\ tr£V - V> ° 

3 

By the compatibilities shown here, ft.* and tr^ descend to maps of Jacobians, 
denoted hj and as before. 

Recall from Section 5.1 that the double coset operator on divisor groups 
corresponds to the double coset operator on modular forms, in our case of 
weight 2 cusps forms 

[A«r 2 ] 2 : ^(A) — ► s 2 (r 2 ), f[r ia r 2 ] 2 = ]T f[p 3 ] 2 . 

j 

Its pullback is denoted by the same symbol, 

[r ia r 2 } 2 : s 2 (r 2 ) A — ► s 2 (a) a , [Aar 2 ] 2 / - /[Aar 2 ] 2 . 

As with the divisor map, we now recognize this as a composition of induced 
maps, [fiar 2 ] 2 = (7Ti)* oa, otr^ 2 , and this operator correspondingly descends 
to a composition of maps of Jacobians. That is, the double coset operator on 
Jacobians is 

[AoA^ = (TTl) J ° Oj O 7T 2 : J ac (^2) ► Jac(A"!), 

where again letting brackets denote equivalence classes modulo homology, 

[r ia r 2 ]M = [4, o [r ia r 2 ] 2 ] for v e 5 2 (r 2 ) A . 
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To summarize, the double coset operator acts on Jacobians as composition 
with its action on modular forms in the other direction. 

The Hecke operators are special cases of the double coset operator. 
For p prime the Hecke operator T p on S 2 (A(TV)) is [A (TV) A(TV)] 2 , 
and similarly for Jo (TV). For d relatively prime to TV the Hecke operator 
(d) on S 2 {r 1 {N)) is [A(TV)aA(TV)] 2 for any a = e A(TV) with 

S = d (mod TV). This proves 

Proposition 6.3.2. The Hecke operators T = T p and T = (d) act by compo- 
sition on the Jacobian associated to A (TV), 

T : Jx(/V) — ► Ji(JV), M i— > [<y9 o T] /or <^ e S 2 (A(TV)) A , 

and similarly for T p on Jo(TV). 

Recall from Section 1.5 and Section 5.2 that when the double coset oper- 
ator is specialized to T p the group A becomes r®(N,p) = A (TV) H r°(p), 
its modular curve X 3 is denoted X®(N,p), and the maps 7r 2 and 7Ti o a 
from X? (TV, p) to X x (TV) are 

^ 2 (A°(TV, P )t) = A(TV)t, (tti o a)(J?(JV,p)r) - A(JV)(t/ P ). 

The discussion here thus describes A explicitly as a pullback followed by a 
pushforward, 

T p = (7Ti o a) j o 7r 2 : Ji(TV) — >• Ji(TV). 

Similarly for the diamond operator, letting any a e A (TV) also denote the 
self-map A(TV)r k> A(TV)a(T) of Xt(N), 

(d) = aj : Ji(TV) — ► Jr(TV), a = [« e A(TV), <S = d (mod TV). 

Again as in Section 5.1, when A T3 A and a = J, the maps of curves in 
the double coset operator configuration collapse to 

7r : X 2 — > Xi. 

The discussion earlier shows that the forward map on dual spaces is restriction, 

^ : 5 2 (A) A — > 5 2 (A) A , n*<p - ^Is^ro- 
This descends to Jacobians, 

7Tj : Jac(Jf 2 ) — >• Jac(Xi), irj[ip] = [<£|s 2 (A)]- 

Especially when A = A (TV) and A = A (TV) this shows that restriction 
induces a natural surjection from Ji(TV) to Jo (TV). We will cite this fact at 
the end of Section 6.6. 
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Exercise 



6.3.1. (a) Show that diagram (6.10) commutes. (A hint for this exercise is at 
the end of the book.) 

(b) Show that diagram (6.11) commutes. 

6.4 Algebraic numbers and algebraic integers 

To prepare for applying the results of the previous section, this section sketches 
the requisite basics of algebraic number theory for the reader who is not 
familiar with them. 

A complex number a is an algebraic number if a satisfies some monic 
polynomial with rational coefficients, 

p(a) = 0, p(x) = x n + cix™ -1 H hc„, ci,...,c„ e Q. 

Every rational number r is algebraic since it satisfies the polynomial x — r, 
but not every algebraic number is rational, for example cither complex square 
root of a rational number r satisfies the polynomial x 2 — r. Complex numbers 
expressible over Q in radicals arc algebraic, for example /i n — e 2, ™/ n , but the 
converse is not true. 

The algebraic numbers form a field, denoted Q. This is shown as follows. 

Theorem 6.4.1. Let a be a complex number. The following conditions on a 
are equivalent: 

(1) a is an algebraic number, i.e., a G Q. 

(2) The ring Q[a] is a finite- dimensional vector space over Q, 

(3) a belongs to a ring R in C that is a finite- dimensional vector space overQ. 

Proof. (1) (2): Let a satisfy the polynomial x n + t^x" -1 + ■ ■ ■ + c n 

with a, . . . , c„ € Q. Then a n = — J27=o Cn-ioi 1 , so the complex vector space 
generated by the powers {1, a, ... , a" -1 } also contains a n . Similarly a n+1 = 
— Y^i=v c n-i al+1 , showing that a n+1 is in the space, and so on by induction 
for all higher powers of a. 

(2) (3) is immediate. 

(3) (1): Let the ring R have basis gi,...,g n as a vector space 
over Q. Then multiplying each gi by a gives a rational linear combination 
of the generators, agi = X)J=i c ij3i- Letting g denote the column vector 
with entries <?i, this means that ag — Mg where M is the n-by-n rational 
matrix with entries Cij. Thus a is an eigenvalue of M, meaning it satisfies the 
characteristic polynomial of M, a monic polynomial with rational coefficients. 



□ 



Condition (3) in the theorem easily proves 
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Corollary 6.4.2. The algebraic numbers Q form a field. 

Proof. Let a and [3 be algebraic numbers. Then the rings Q[a] and Q[/3] 
have respective bases {a 4 : 0 < i < m} and {ft : 0 < j < n} as vector 
spaces over Q. Let i? = Q[a,/3], spanned as a vector space over Q by the 
set {a 1 ft : 0 < i < to, 0 < j < to}. Then a + (3 and af3 belong to R, 
making them algebraic numbers by condition (3) of the theorem. If a =/= 0 
then its polynomial p(x) can be taken to have a nonzero constant term c n 
after dividing through by its lowest power of x. The relation p(a) = 0 is 
a^ 1 = (p(a) — c n )/(—c n a) G Q[a], making or 1 an algebraic number by 
condition (3) as well. □ 

If a and (3 are algebraic numbers satisfying the monic rational polynomials 
p(x) and q(x) then the proofs of Corollary 6.4.2 and of (3) => (1) in 
Theorem 6.4.1 combine to produce the polynomials satisfied by a + [3 and a(3 
and 1/a if a ^ 0. The theory of resultants provides an efficient algorithm to 
find these polynomials (see Exercise 6.4.1). 

One can now consider complex numbers a satisfying monic polynomials 
with coefficients in Q. But in fact Q is algebraically closed, meaning any such a 
is already in Q. The proof again uses condition (3) in the theorem. 

Corollary 6.4.3. The field Q of algebraic numbers is algebraically closed. 

Proof. (Sketch.) Consider a monic polynomial x n + c\x n ~ x + ■ ■ ■ + c n with 
coefficients a € Q and let a be one of its roots. Since each ring Q[ci] is a 
finite-dimensional vector space over Q, so is the ring R = Q[ci , . . . , c n ]. Let 
R' = R[a\. If {vi : 1 < i < m} is a basis for R over Q then 



is a spanning set for R' as a vector space over Q. (This set is not necessarily 
a basis since a might satisfy a polynomial of lower degree.) Now condition (3) 



Definition 6.4.4. A number field is a field K C Q such that the degree 
[K : Q] = diniQ(K) is finite. 

The ring of integers Z in the rational number field Q has a natural analog 
in the field of algebraic numbers Q. A complex number a is an algebraic integer 
if a satisfies some monic polynomial with integer coefficients. The algebraic 
integers are denoted Z. The algebraic integers in the rational number field Q 
are the usual integers Z (Exercise 6.4.3), now called the rational integers. 
Every algebraic number takes the form of an algebraic integer divided by a 
rational integer (Exercise 6.4.4). Similarly to Theorem 6.4.1 and its corollaries 
(Exercise 6.4.5), 

Theorem 6.4.5. Let a be a complex number. The following conditions on a 
are equivalent: 



{viCt^ : 1 < i < to, 0 < j < n} 



of the theorem shows that a € Q. 



□ 
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(1) a is an algebraic integer, i.e., a € Z, 

(2) The ring Z[a] is finitely generated as an Abelian group, 

(3) a belongs to a ring R in C that is finitely generated as an Abelian group. 

Corollary 6.4.6. The algebraic integers Z form a ring. 

Corollary 6.4.7. The algebraic integers form an integrally closed ring, 
meaning that every monic polynomial with coefficients in Z factors down to 
linear terms over Z, i.e., its roots lie in Z. 

Definition 6.4.8. Let K be a number field. The number ring of K is the 

ring of algebraic integers in K, 

Ok = ZHK. 

We need a few more results from algebraic number theory, stated here 
without proof. Every number ring Ok is a free Abelian group of rank [K : Q] . 
That is, letting d = [K : Q], there is a basis {ot\, . . . , a d } C Ok such that Ok 
is the free Abelian group 

O k = Zai 8 • •• 8 Za d . 

There exist d nonzero field homomorphisms from K to C. These have triv- 
ial kernel, making them embeddings a\ ,<?,} : K C, and they fix the 

rational field Q clcmcntwise. The d-by-d matrix 

A= [a?] = 

has nonzero determinant. 

Every embedding a : K C extends (in many ways) to an automorphism 
a : C — ► C. Conversely every such automorphism restricts to an automor- 
phism a : Q — > Q of the algebraic numbers that takes the algebraic integers 
Z to Z, and this automorphism further restricts to an embedding of K in Q 
that injects Ok into Z. The only algebraic numbers that are fixed under all 
automorphisms of C are the rational numbers Q. After this we will be casual 
about distinguishing between maps a as embeddings of K, as automorphisms 
of C, or as automorphisms of Q. 

Exercises 

6.4.1. Recall from algebra that if f(x) = aox m + aix n_1 + • • • + a m and 
g(x) = b 0 x n + bix 11 ^ 1 + ■■■ + £>„ then their resultant R(f(x),g(x),x) is the 
determinant of their Sylvester matrix 



a. 



a" 
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M 



a 0 oi 



a 0 Oi ... 
6 0 61 • • • b n 
bo h ■ ■■ b n 



bo h 



(n staggered rows of a^s, m staggered rows of bj's, all other entries 0). The 
resultant eliminates x, leaving a polynomial in the coefficients that vanishes 
if and only if / and g share a root. Let p(x) and q(x) be monic polynomials 
in Q[x]. Consider the resultants 

q(t, u) = R{p(s),u -s-t;s), r(u) = R(q(t, u),q(t);t). 

Show that if a and (3 satisfy the polynomials p and q then a + (3 satisfies the 
polynomial r. Similarly, find polynomials satisfied by a/3 and 1/a if a ^ 0. 
(A hint for this exercise is at the end of the book.) 

6.4.2. Use the methods of the section or of Exercise 6.4.1 to find a monic 
integer polynomial satisfied by y/2 + ^3. 

6.4.3. Show that Z n Q = Z. (A hint for this exercise is at the end of the 
book.) 

6.4.4. Show that every algebraic number takes the form of an algebraic integer 
divided by a rational integer. (A hint for this exercise is at the end of the book.) 



6.4.5. Prove Theorem 6.4.5 and its corollaries. 



6.5 Algebraic eigenvalues 

Returning to the material of Section 6.3, recall the action of the weight-2 
Hccke operators T = T p and T = (d) on the dual space as composition from 
the right, 

t ■. s 2 (r 1 (N)) A ^s 2 (r 1 (N)) A , ^ r r, 

and recall that the action descends to the quotient Ji(AT). Thus the oper- 
ators act as endomorphisms on the kernel H\(Xi(N), Z), a finitely gener- 
ated Abelian group. In particular the characteristic polynomial f(x) of T p 
acting on Hi(Xi(N), Z) has integer coefficients, and being a characteristic 
polynomial it is monic. Since an operator satisfies its characteristic poly- 
nomial, f(T p ) = 0 on Hipf^A^Z). Since T p is C-lincar, also f(T p ) = 0 
on S 2 (Ti(A0) A and so f(T p ) = 0 on S 2 (r 1 (N)). Therefore the characteris- 
tic polynomial of T p on £2 (A. (AO) divides f(x) and the eigenvalues of T p 
satisfy f(x), making them algebraic integers. Since p is arbitrary this proves 
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Theorem 6.5.1. Let f G <S 2 (A(A0) be a normalized eigenform for the Hecke 
operators T p . Then the eigenvalues a n (f) are algebraic integers. 

To refine this result we need to view the Hecke operators as lying within 
an algebraic structure, not merely as a set. 

Definition 6.5.2. The Hecke algebra over Z is the algebra of endomor- 
phisms of <S 2 (A(A0) generated over Z by the Hecke operators, 

T z = Z[{T n ,<n):neZ+}]. 

The Hecke algebra Tc over C is defined similarly. 

Each level has its own Hecke algebra, but N is omitted from the notation 
since it is usually written somewhere nearby. Clearly any / G S2(r±(N)) is an 
eigenform for all of Tc if and only if / is an eigenform for all Hecke operators 
T p and (d). 

For the remainder of this chapter the methods will shift to working with 
algebraic structure rather than thinking about objects such as Hecke oper- 
ators one at a time. In particular modules will figure prominently, and so 
in this context Abelian groups will often be called Z-modules. For example, 
viewing the Z-modulc Tz as a ring of endomorphisms of the finitely generated 
free Z-module Hi {X\ (N) , Z) shows that it is finitely generated as well (Ex- 
ercise 6.5.1). Again letting /(r) = X^^Li a n(f)q n be a normalized eigenform, 
the homomorphism 

A/:T Z ^C, Tf=X f (T)f 

therefore has as its image a finitely generated Z-modulc. Since the image 
is Z[{a n (f) : n e Z+}] this shows that even though there are infinitely many 
eigenvalues a n (f), the ring they generate has finite rank as a Z-module. More 
specifically, letting 

If = ker(A/) = {T e T z : Tf = 0} 

gives a ring and Z-module isomorphism (Exercise 6.5.2) 

Tz/// ^ Z[{a„(/)}]. (6.12) 

The image ring sits inside some finite-degree extension field of Q, i.e., a number 
field. The rank of Tz/// is the degree of this number field as an extension 
of Q. 

Definition 6.5.3. Let f e <S 2 (A(A0) be a normalized eigenform, /(r) = 
X^^Li a n<z" • The field K/ = Q({a„}) generated by the Fourier coefficients 
of f is called the number field of f. 
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The reader is referred to William Stein's web site [Ste] and John Cremona's 
tables [Cre97] for examples. 

Any embedding a : Kj <^-» C conjugates / by acting on its coefficients. 
That is, if /(r) = X^^Li a nQ n then notating the action with a superscript, 

oo 

r(r) = 5>^". 

n=l 

In fact this action produces another eigenform. 

Theorem 6.5.4. Let f be a weight 2 normalized eigenform of the Hecke oper- 
ators, so that f £ S 2 (N,x) for some N andx- LetKf be its number field. For 
any embedding a : K/ C the conjugated f a is also a normalized eigenform 
in S2{N 1 x' 7 ) where X a { n ) = x{ n Y ■ If f * s a newform then so is f a . 

The proof will require two beginning results from commutative algebra, so 
these are stated first. 

Proposition 6.5.5 (Nakayama's Lemma). Suppose that A is a commu- 
tative ring with unit and J £ A is an ideal contained in every maximal ideal 
of A, and suppose that M is a finitely generated A-module such that JM = M. 
Then M = {0}. 

Proof. Suppose that M ^ {0} and let mi, . . . , m n be a minimal set of gen- 
erators for M over A. Since JM — M, in particular m n £ JM, giving it the 
form m n = a\m\ + • • • + a n m n with all £ J. Thus 

(1 - a n )m n = aimi H h a„_im„_i. 

But 1 — a„ is invcrtible in A, else it sits in a maximal ideal, which necessarily 
contains a n as well and therefore is all of A, impossible. Thus mi, . . . , m„_i 
is a smaller generating set, and this contradiction proves the lemma. □ 

Again suppose that A is a commutative ring with unit and J C A is an 
ideal, and suppose that M is an A-module and a finite-dimensional vector 
space over some field k. The dual space M A — Honik(M, k) is an A-module 
in the natural way, atp = ip o a for a £ A and ip £ M A (i.e., (a(p)(m) = 
ip(am) for m £ M), and similarly for (M/JM) A . Let M[J] be the elements 
of M annihilated by J, and similarly for M A [J]. Then there exist natural 
isomorphisms of A- modules (Exercise 6.5.3) 

(M/JM) A = M A [J], M A /JM A = M[J] A . (6.13) 

Now we can prove Theorem 6.5.4. 

Proof. The Fourier coefficients {a n } are a system of eigenvalues for the opera- 
tors T„ acting on <S 2 (A(A0)- We need to show that the conjugated coefficients 
{a^} are again a system of eigenvalues. 
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As explained at the beginning of the section, the action of Tz on 52 (A (N)) 
transfers to the dual space <S 2 (-Ti(-A0) A as composition on the right and 
then descends to the Jacobian Ji(N), inducing an action on the homol- 
ogy group Hi(Xi(N), Z). This Z-module is free of rank 2g where g is the 
genus of Xi(N) and the dimension of <S 2 (A (N)). Take a homology basis 
{<pi, . . . , <p 2g } C S 2 (A(A0) A , so that 

Hi(Xi(ao,z) = zipt e • • • e z^ 2s . 

With respect to this basis, each group clement J2 2 jLi n j ( Pj 1S represented as 
an integral row vector v — [rij] £ Z 29 and each T £ Tz is represented by an 
integral 2g-by-2g matrix [T] <G M 2g (Z), so that the action of T as composition 
from the right is multiplication by [T], 

T:v^tv[T\. (6.14) 

This action of T z extends linearly to the free C-module generated by the set 
{ipi, . . . , (fi2g}, the 2g-dimensional complex vector space 

v = evi e • • • e Ctp 2g - 

Each element (zi^i, • ■ • , Z2 g f2g) is represented by a complex row vector v = 
[zj] e C 2g , and the action of each T is still described by (6.14). Suppose 
{A(T) : T G T z } is a system of eigenvalues of T z on V, i.e., some nonzero 
v e C 2g satisfies 

v[T] = X(T)v, T e T Z - 

Let a : C — > C be any automorphism extending the given embedding 
<j : <—} C. Then cr acts on v clcmentwise and fixes the elements of each 
matrix [T] since it fixes Q. Thus 

v°[T\ = (v[T]) a = (\(T)v) a = A(T) CT if T , T € T z , 

showing that if {A(T) : T e Tz} is a system of eigenvalues on V then so is 
{A(T) CT : T e Tz}. To prove the theorem, this result needs to be transferred 
from V to S 2 (r 1 (N)). 

For convenience, abbreviate <S 2 (A(-^0) *° ^2 for the duration of this proof. 
The space S 2 is isomorphic as a complex vector space to its dual space 

5 2 A = + ■ ■ ■ + C V2g - 

The dual space is not V but a g-dimensional quotient of V under the map 
( z\ ipi, Z2 g f2g) ^ Zjtfij. The map has a g-dimcnsional kernel because 
the {<fij} are linearly independent over R but dependent over C. The proof 
will construct a complementary space 5^ isomorphic to 5 2 such that V is 
isomorphic to the direct sum S 2 ©<?^ as a Tz-modulc and therefore the desired 
result transfers to the sum, and then the proof will show that the systems of 
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eigenvalues on the sum are the systems of eigenvalues on S 2 , transferring the 
result to £2 as necessary. 

To construct the complementary space, recall the operator wn — [[ _jv o]h 
from Section 5.5, satisfying the relation wjyT — T*wn for all T £ Tz where as 
usual T* is the adjoint of T. For any cusp form g £ S 2 consider an associated 
map ip g from cusp forms to scalars, 

ip g :S 2 — > C, ip g (h) = (w N g, h). 

Then tp g (h + h) = ip g {h) + tp g (h) but since the Petersson inner product is 
conjugate-linear in its second factor, 

ip g (zh) = zip g (h), z £ C. 

Thus tf) g belongs to the set 5^ of conjugate-linear functions on 52, the complex 
conjugates of the dual space S 2 (Exercise 6.5.4). This set forms a complex 
vector space with the obvious operations. It is immediate that ip g + g = ip g + 4'g 
and vp zg = zip g for g,g £ 5 2 and z £ C, and therefore the map 

is C-lincar. It has trivial kernel, making it an isomorphism. The vector space 
S2 is a Tz-module with the Hecke operators acting from the right as compo- 
sition, and the linear isomorphism \P is also Tz-linear since 

^j Tg {h) = (w N Tg, h) = (T*w N g, h) = (w N g, Th) = (V> 9 ° T){h). 

That is, 52 and 5^ are isomorphic as complex vector spaces and as Tz- 
modules. In particular, every system of eigenvalues {A(T) : T £ Tz} on 52 is 
a system of eigenvalues on 5^ and conversely. 

Also, every system of eigenvalues on 52 is a system of eigenvalues on 
the dual space S£ and conversely. To see this, let / £ S 2 be a normalized 
eigenform. Similarly to before there is a map 

A /: T C ^C, Tf = X f (T). 

(We need the complex Hecke algebra Tc in this paragraph.) Let J/ = 
ker(A/) = {T £ Tc : Tf = 0}, a prime ideal of Tc- An application of 
Nakayama's Lemma shows that J/52 7^ 52 (Exercise 6.5.5), making the quo- 
tient S 2 / JfS 2 nontrivial. It follows that the subspace of the dual space anni- 
hilated by Jf, 

S 2 [Jf] = W e 5 2 A : <p o T = 0 for all T £ Jf}, 

is nonzero since it is isomorphic to (5 2 /J/5 2 ) A by the first isomorphism 
in (6.13). Since T x is the identity operator, T — Xf(T)Ti £ J f for any T £ T c , 
and so any nonzero ip £ S 2 [Jf] satisfies 
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pT = p(T- A/(T)Ti) + A/(2> = \f(T)<p, T e T c . 

Restricting our attention to Tz again, this shows that {A/(T) : T e Tz} is a 
system of eigenvalues on <S^ as claimed. The converse follows by replacing <S 2 
and S£ with their duals, since the finite-dimensional vector space <S 2 is natu- 
rally isomorphic to its double dual as a T z -module. Thus the cusp forms S 2 
and the sum S£ © S£ have the same systems of eigenvalues. 
Consider the C-linear map 

V ^>S£®S£, (z 1 ip 1 ,...,Z 2g <P2g) l-> (JZW.JZ 2 ^')- 



This is also a Tz-module map since ifj o T — ipj o T. The map has trivial 
kernel since if z j<Pj = 0 in <S^ and z j'fj — 0 m $2 * nen both Zj'-Pj = 0 
and "}2zj(pj = 0 in 5 2 , i.e., Re(-z,-)y?j = 0 and Y] lm(zj)ipj = 0 in <S>^; but 
the {(fij} are linearly independent over R, so this implies Zj = 0 for all j. 
Since the domain and codomain have the same dimension the map is a linear 
isomorphism of Tz-modulcs. The result that if {A(T) : T € Tz} is a system 
of eigenvalues then so is {X(T) a : T <E T z } now transfers from V to S£ © S£ 
and then to S 2 - Thus if /(r) = a n<z" is a normalized eigenform in S 2 (N, x) 
then its conjugate / ct (t) = «^<z" is a normalized eigenform in 5 2 (iV, x CT ) as 
desired. 

It remains to prove the last statement of the theorem, that if / is a newform 
then so is f a . By Theorem 5.8.3, f a takes the form / CT (r) = YJi a i/i( n i T ) 
where each /j is a newform at level Mi with njMj | N. (Note that this uses 
only the part of that theorem that we have proved, that the set of such /, 
spans <S 2 (-Ti(./V)).) Let t = a^ 1 : C — > C, an extension of another embedding 

r : K f C. Then / = (/ CT ) T = T,i a I fi ( n i T ) ■ If / CT is not ncw thcn b y 
Exercise 5.8.4 it is old and all Mi are strictly less than N. Since each fl is 
also a modular form at level Mj this shows that / is old as well. The result 
follows by contraposition. □ 

Linearly combining the normalized eigenforms gives modular forms with 
coefficients in Z. 

Corollary 6.5.6. The space <S 2 (-Ti(A0) has a basis of forms with rational 
integer coefficients. 

Proof. Let / be any newform at level M where M | N. Let K = K/ be the 
number field of /. Let {ai, . . . , ay} be a basis of Ok as a Z-module and let 
{cti, . . . , a d\ be the embeddings of K into C. Consider the matrix from the 
end of the previous section and the vector 





"of • 


■ al d ~ 




T 1 ' 


A = 






, f = 








a d . 
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and let g = Af, i.e., 

d 

'/, »- T r- ' 1 '<■ 

3 = 1 

Then span({gi, . . . ,gd}) — span({/ CTl , . . . , / CTti }) since A is invertible. Each gi 
takes the form gi(r) = ^2 n a n (gi)q n with all a n {gi) <G Z. For any automor- 
phism a : C — > C, as a 3 - runs through the embeddings of K/ into C so 
does Oj<j (composing left to right), and so 

d 
3 = 1 

That is, each a n (gi) is fixed by all automorphisms of C, showing that 
each a n (gi) lies in Z n Q — Z. Repeating this argument for each newform / 
whose level divides N gives the result. □ 

Exercises 

6.5.1. Let M be a free Z-modulc of rank r. Show that the ring of endomor- 
phisms of M is a free Z-modulc of rank r 2 , and so any subring is a free 
Z-module of finite rank. 

6.5.2. Let / e 5 2 (A(Ar)) be a normalized eigenform. Thus / e S 2 {N,x) 
for some Dirichlet character x : (Z/A^Z)* — ► C* and A/((d)) = x(^) for all 
d G (Z/iVZ)*. Show that there is a ring and Z-modulc isomorphism Tz/If — > 
Z[{o n (/),x(d)}]. Show that adjoining the x(^) values is redundant, making 
(6.12) in the text correct. (A hint for this exercise is at the end of the book.) 

6.5.3. Prove the isomorphisms (6.13). (A hint for this exercise is at the end 
of the book.) 

6.5.4. Let V be any complex vector space with dual space V A . Show that 
the set V A — {Cp : ip e V A } is the set of functions tp : V — > C such that 
ip(v + v') = il>(v) + ip(v') and tp(zv) = zip(v) for all v, v' e V and zeC. 

6.5.5. Let Jf, T c , and S 2 be as in the proof of Theorem 6.5.4. Show that 
Jf is a prime ideal. Define the local ring of Tc at Jf as a set of equivalence 
classes of formal elements 

A = {T/U : T e T c , U e T c - J f }/ ~ 

where the equivalence relation is 



T/U^T'/U 1 if V(f/'T — f/T') = 0 for some nonzero V e Tc - J/. 
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Show that "~" is indeed an equivalence relation and that consequently the 
local ring is a ring under the usual rules for manipulating fractions, 

T/U + T'/U' = (U'T+UT')/(UU'), (T/U) ■ (T' /U') = (TT')/(UU'), 

taken at the level of equivalence classes. Show that the ideal J = JfA is the 
unique maximal ideal in the localization. Similarly define the local module 
of S2 at Jf as a set of equivalence classes for formal elements 

M={g/U:geS 2 ,U€Tc-Jf}/~ 

where the equivalence relation is 

g/U ~ g'/U' if V(U'g - Ug') = 0 for some nonzero V e T c - J/. 

Show that this is an equivalence relation and that consequently the local 
module is an A-modulc under the rules 

g/U + g'/U' = (U'g + Ug')/(UU'), (T/U) ■ (g/U') = (Tg)/(UU'), 

again at the level of equivalence classes. Show that M ^ {0}, so by Nakayama's 
Lemma JM ^ M. Show that therefore J/S2 7^ £2- 

6.6 Eigenforms, Abelian varieties, and Modularity 

This section decomposes the Jacobian 3\(N) into a direct sum of complex tori 
associated to newforms, and similarly for Jo (AO- The resulting restatement of 
the Modularity Theorem associates a newform to an elliptic curve. 

Studying complex tori requires some preliminary results on tensor products 
of Abelian groups and fields. For the reader not familiar with this material, 
the following ad hoc definition of tensor product in this special case is all we 
need. 

Definition 6.6.1. Given a finitely generated Abelian group G and a field k, 
the tensor product G ® k is the vector space over k consisting of k-linear 
sums of elements g®k where jeG and k e k, subject to the relations 

g ® k + g' ® k = (g + g') <g> k, 
(g®k) + (g®k')=g®(k+ k'), 
k(g®k') = g® (kk 1 ). 

The fields of interest in this section are C and R. For these fields, and 
for any field of characteristic 0, if g is a group element of finite order n then 
g ® k — (ng) ® (k/n) — 0 ® (k/n) = 0 for any k € k, and so Zg ® k = 0. 
If g has infinite order then Zg ® k is the 1-dimcnsional vector space spanned 
by g (8) 1 with the usual field operations carried out in the second component. 
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For a direct sum of cyclic groups the tensor product is the corresponding- 
direct sum of vector spaces, 

(0G<) ®k = 0(G,®k). 

i i 

If G is a subgroup of some vector space over k then tensoring it with k is 
different from taking the k-lincar combinations of its elements. The tensor 
product G®k, where linear independence over Z is preserved over k, surjects 
to the vector space k • G, where linearly independent sets over Z can be 
dependent over k, the surjection taking elements ^gi®ki to X^ift- For 
example, in the proof of Theorem 6.5.4 the vector space V was H 1 (X 1 (iV), Z)(g> 
C and the proof made use of the surjection from V to 6>2(A(A0) A = C • 
Hi(Xi(iV),Z). Similarly, T z <g> C surjects to T c . 

Lemma 6.6.2. Let G be a finitely generated Abelian group and let k be a field 
of characteristic 0 . Then 

(a) G®k = k rank < G ). 

(b) (G/iT) ® k = (G ® k) /(if <g> k) for any subgroup K C G. 

(c) If A is a commutative ring with unit and G is an A-module then G <£> k is 
an A-module under the rule a(Xi 9i ® &i) = S»( ffl 5») ® ^/ ^ * s an *^ ea ^ 
of A then (JG)®k = J(G®k). 

Proof. Exercise 6.6.1. □ 

Now we can proceed with the material. Let / € S 2 (A(-^/)) be a newform 
at some level M/ and therefore an eigenform of the Hcckc algebra Tz- Recall 
the eigenvalue map 

A,:T Z — >C, Tf = X f (T)f 

and its kernel 

I f = ker(A / ) = {T e T z : Tf = 0}. 

As explained in the previous section, the map T i->- A/(T) induces a Z- 
modulc isomorphism from the quotient Tz/// to Z[{o n (/)}] and the latter has 
rank [K/ : Q]. Since Tz acts on the Jacobian Ji(M/), the subgroup I/Ji(M/) 
of Ji(M/) makes sense. 

Definition 6.6.3. T/ie Abelian variety associated to f is defined as the 
quotient 

Af = J 1 (Mf)/IfJ 1 (M f ). 

By this definition Tz/// acts on Af and hence so does its isomorphic 
image Z[{a„}]. Since A/(T p ) = a p (f) the following diagram commutes: 
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,h(M f )^^.h(M f ) 



(6.15) 



The bottom row here is not in general multiplication by a p (f) but rather a p (f) 
acting as T p at the level of Af, so for example if (p S Af and a : K/ C 
is an embedding then (a P (f)<f)(f a ) = a P (f)° V(/ CT ) by Theorem 6.5.4. The 
submodulc Z of Z[{a„}], being the isomorphic image of the submodule Z + If 
of Tx/If, does act as multiplication, so in particular if a p (f) lies in Z then 
indeed it acts as multiplication by itself. 

To describe how the Jacobian decomposes into Abelian varieties Af put 
an equivalence relation on newforms, 

/ <~ / / = f a for some automorphism a : C — > C. 

Let [/] denote the equivalence class of /, 

[/] = {f a : a is an automorphism of C}. 

Its cardinality is the number of embeddings a : Kj C, and each f a e [/] 
is a newform at level Mf by Theorem 6.5.4. The subspace of <S 2 (A(-^/)) 
corresponding to / and its equivalence class is 

V, = span([/])cS 2 (r 1 (M / )) ) 

a subspace of dimension [K/ : Q], the number of embeddings. Restricting the 
subgroup Hi(Xi(M/), Z) of <S 2 (Ti(-M/)) A to functions on Vf gives a subgroup 
of the dual space Vf , 

Af = R 1 (X 1 (M f ),Z)\ Vf . 
Restricting to Vf gives a well-defined homomorphism 

h(M f ) — ► Vf/Af, M i ^ + Ay for p e 5 2 (A(M / )) A . 

Proposition 6.6.4. Le£ / e 5 2 (A(A^/)) & e a newform with number field Kf. 
Then restricting to Vf induces an isomorphism 

A f ^ Vf/A f , [<p] + IfMMf) ^ <p\ Vj + Af for <p e 5 2 (A(M / )) A , 

and £/ie rigW side is a complex torus of dimension [K f : Q] . 

Proof. Let S 2 = S 2 {r l {M f )) and let B x = Hi(Xi(M/), Z) C 5^. The Abelian 
variety is 

Af = 3x{M f )/I f 3x{M f ) = {S£/Hi)/I f {S£lH{) 

= S£/(I f S£ + ifi) £* (^///^/(image of ^ in S£/I f S£). 
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But S§ /IfSQ — S2[If] A by the second isomorphism in (6.13), and in fact the 
isomorphism is the restriction map ip + IfSQ >-> w\s 2 [ij] f° r <P e ^2 i so now 

A f S 2 [I f ] A /H 1 \ S2[If] , M + IfJi(M f ) ^ <p\ Sa[If] + Hi| 52[J/] . 

We need to show that <S 2 [7/] = V/, and that Af = H\\v f is a lattice. 
Clearly Vj C <S 2 [7/]. To show equality, consider the pairing 

T C xS 2 ^C, (T,g)^ ai (Tg). 

This pairing is linear and nondcgcncratc in each component, a so-called per- 
fect pairing. Linearity in each component is clear. To show nondegeneracy in 
the first component, suppose T £ Tc and a\(Tg) = 0 for all g £ S 2 - Then also 
0 = ai (TT„ff) = cii(T n Tg) = a n (Tg) for all n £ Z+ and g £ S 2 , so Tg = 0 
for all g, and T = 0. Nondegeneracy in the second component is similar 
(Exercise 6.6.2(a)). Thus the map a\(Tg)) is a vector space iso- 

morphism <S 2 — T c . The map is also a Tz-module isomorphism since (T'T, g) 
and (T,T'g) both pair to a\(T'Tg) = ai(T(T'g)). Now (Exercise 6.6.2(b)) 

dim(5 2 [//]) - dim(5 2 [//] A ) = dim(S£/I f S£) = dim(T c ///T c ). (6.16) 

The surjection from Tz <8> C to Tc takes elements ^ Ui ® to ^ z,t/i. If 
each Ui £ If then since Zi = z(T\ £ Tc the image lies in //Tc- Thus the 
surjection descends to a surjection from (Tz <8> C)/(7/ ® C) to Tc/7/Tc, and 
so (Exercise 6.6.2(b) again) 

dim(5 2 [J/]) < dim((T z ® C)/^ ® C)) = dim((T z /J/) ® C) 

= rank(T z /i» = [K, : Q] = dim(V». ' 

The containment V/ C <S 2 [-f/] an d the relation dim(5 2 [//]) < dim(V/) combine 
to show that <S 2 [7/] = Vj as desired. 

To show that yl/ is a lattice in we need to establish that the R-span 
of Af is Vf~ and that rank(Af) < diniR,(V^). The containment S 2 3 V/ of 
vector spaces over R makes the restriction map of dual spaces 7r : <S^ — > 
a surjection. Since the R-span of Hi is S 2 , the R-span of Af — n(H\) is — 
7r(S^) as desired. Furthermore, taking dimensions over R (Exercise 6.6.2(b) 
again) , 

dim(F / A ) = dim(5 2 [//] A ) = dim(5 2 A /7/5 2 A ) 
= dim((If 1 ®R)/J / (ff 1 ®R)) 
= dim((77 1 ®R)/(7 / 77i®R)) 
= dim((iJi/7 / i7i) ® R) = rank(iJ 1 // / ffi). 

On the other hand, /ly = n(Hi) = Hi/ (Hi n ker(7r)), and 7/ acts on 77 1; 
taking it into ker(7r) since Vf = ^[7/], so 7/7fi C i7inkcr(-7r). This shows that 
there is a surjection Hi/If Hi — > Af, making rank(ylj) < rank(i7i/7/7fi) = 
dimR(V^ A ), and the proof is complete. □ 
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The useful equivalence relation between tori in our context is isogeny, as 
in Section 1.3. Extending the definition to more than one dimension, 

Definition 6.6.5. An isogeny is a holomorphic homomorphism between com- 
plex tori that surjects and has finite kernel. 

By Proposition 6.1.4, a holomorphic homomorphism between complex tori 
C 9 /A g and C h /A h takes the form 

ip(z + Ag) = Mz + A h , Me M M (C), MA g c A h . 

This is an isogeny when h = g and M is invertible (Exercise 6.6.3(a)), and 
now the one-dimensional argument in Chapter 1 that isogeny is an equivalence 
relation extends easily to dimension g (Exercise 6.6.3(b)). 

Theorem 6.6.6. The Jacobian associated to A(-W) is isogenous to a direct 
sum of Abelian varieties associated to equivalence classes of newforms, 

Here the sum is taken over a set of representatives f € <S2(-Ti(M/)) at levels 
Mf dividing N, and each nif is the number of divisors of N/Mf. 

Proof. By Theorem 5.8.3 and Theorem 6.5.4, S 2 (ri(N)) has basis 

Mm = UUU n™-) 

/ n a 

where the first union is taken over equivalence class representatives, the second 
over divisors of N/Mf, and the third over embeddings of K/ in C. (This relies 
on the full strength of Theorem 5.8.3. The part of that theorem that we have 
proved — that B 2 (N) spans — shows that a subset of B 2 (N) is a basis, and this 
is enough for the proof here except that the end of Theorem 6.6.6 is weakened 
to "each m/ is at most the number of divisors of N/Mf.") For each pair (/, n) 
let d = [Ky : Q], let a±, . . . , ad be the embeddings of Ky in C, and consider 
the map 

$ f>n : <S 2 (A(A0) A — ► V f A 
taking each <p e S 2 (r 1 (N)) A to ip £ where 

d d 

i=i j=i 

The map takes Hi(Xi(N),Z) into A f = Hi(Xi(M/), Z)\ Vf . To see this, let 
ip = J a for some loop a in Xi(N). Then correspondingly 

^(f a {r)) = n I f a (nT)dT = I f a (r)dT where a{t) = na(t). 
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Here the holomorphic differential u>(f a (nT)) on Xi(N) is identified with its 
pullback to H as in Chapter 3 and a is identified with some lift in H. Con- 
sequently a is the lift of a loop in X\(Mf) (Exercise 6.6.4), and <P/ )fl takes 
Hi(Xi(iV), Z) to A f as claimed. 

Taking the product map over all pairs (/, n) gives 

V = n^/n : S 2 (A(A0r 0 V} A = ©(V?) m '. 

/,»» /,n / 

This surjects since if </? e S2(r 1 (N)) A picks off a basis element /"(jjt), i.e., 
<f(f a (nT)) = 1 and </? = 0 on the rest of the basis, then its image ip picks 
off the basis element / CT (r) in the copy of corresponding to n. Counting 
dimensions shows that 9 is a vector space isomorphism. It descends to an 
isomorphism of quotients 

W : J 1 (N) ^ 0(F / A ) ro V^(H 1 (X 1 (7V), Z)). 



Since "^(Hi (Xi ( Af) , Z)) C Q)jA™ f is a containment of Abelian groups of the 
same rank, the natural surjection 

n ; 0(l/A )TO/MHi(Xi(7V)jZ)) _^ 0 ( yA M/r/ 
/ / 

has finite kernel. By Proposition 6.6.4 there is an isomorphism 

,:0(T//M / r^0^, 
/ / 

and so i o tt o & : Ji(JV) — > 0 A™' is the desired isogeny. □ 

For future reference in Chapter 8 we make some observations in connection 
with Theorem 6.6.6 and its proof. The following diagram commutes, where p 
is a prime not dividing N, the vertical arrows are the isogeny of the theorem, 
and the sum on the bottom row is the sum in the theorem notated slightly 
differently (Exercise 6.6.5(a)): 

h(N) >3 1 (N) 

(6.19) 

Q Af " ; 0.1,. 

Since isogeny is an equivalence relation there exists an isogeny from Af 

back to Ji(iV). The following diagram commutes as well, where the vertical 
arrows denote this isogeny (Exercise 6.6.5(b)): 
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0A, 0.1, 

(6.20) 

Ji(JV) >Ji(N). 



We end this chapter with a version of the Modularity Theorem involving 
Abelian varieties. Since the group r 0 (N) contains Ji(iV), its associated ob- 
jects S 2 (/o(A0), X 0 (N), and Jo(-W) are respectively a subspace, an image, 
and a quotient of their counterparts for ri(N), smaller objects in all cases. 
Our practice is to develop results in the larger context of J\ (N) for general- 
ity but to state versions of the Modularity Theorem in the restricted context 
of ro(N), making them slightly sharper. 

In particular, for each newform / € S 2 (r 0 (Mf)) let 

A) = Jo(M / )/7 / J 0 (M / ). 

This is another Abelian variety associated to /. The proofs of Proposition 6.6.4 
and Theorem 6.6.6 transfer to Jo(iV), showing that there is an isomorphism 

A' f V f A I A) where A) = Hi (X 0 ( Af» , Z) | Vf 

and an isogeny 

Jo(JV)->0W" 
/ 

where now the sum is taken over the equivalence classes of newforms f G 

s 2 (r Q (M f )). 

In fact Af and A'^ are isogenous. By Proposition 6.2.4, Af is a finite-index 
subgroup of /lp and thus there is a surjection from Af to A'^, 

V f A /A f — > Vf/A'f, V + A f ^Lp + A f , 

whose kernel A'^/Af is finite, showing that there is an isogeny — >• A^. The 
purpose of introducing A'^ is to phrase the following version of the Modularity 
Theorem entirely in terms of Jo (AT). By incorporating Abelian varieties this 
version associates a modular form, in fact a newform, to an elliptic curve. 
The distinction between using A'f or Af in the theorem for a newform / is 
only cosmetic; what matters is that / itself lies in the smaller space associated 
to r 0 (N) rather than the larger one associated to Ji(iV), making the theorem 
more specific. 

Theorem 6.6.7 (Modularity Theorem, Version Ac)- Let E be a complex 
elliptic curve with j(E) € Q. Then for some positive integer N and some 
newform f <G S2(r 0 (N)) there exists a surjective holomorphic homomorphism 
of complex tori 

A'f E. 
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Versions Jc and Ac are equivalent by the isogeny Jo(^V) — ► ^[7^(^/) m/ 
since each restriction A'j — > E of a map ^^(^4/) m/ — > E is trivial or 

surjects. Note that the integers N appearing in Versions Jc and Ac are not 
necessarily the same: if Version Jc holds for N then Version Ac holds for 
some N' | N, while if Version Ac holds for TV then so does Version Jc- Later 
versions of the Modularity Theorem will specify N precisely. 

The versions of the Modularity Theorem so far are designated as vari- 
ants of Type C because they all involve holomorphic maps of compact Rie- 
mann surfaces and complex tori. The versions in the next chapter will move 
from complex analysis to algebraic geometry and change the field of definition 
from C to Q. 

Exercises 

6.6.1. Prove Lemma 6.6.2. 

6.6.2. (a) In the proof of Proposition 6.6.4, prove that the pairing Tc x5 2 — > 
C is nondegenerate in the second component. 

(b) Justify the equalities in (6.16), (6.17), and (6.18). 

6.6.3. (a) Show that a map ip(z + A g ) = Mz + A h where M e M h , g (C) and 
MA g C Ah is an isogeny when h = g and M is invertible. 

(b) Show that isogeny is an equivalence relation. 

6.6.4. (a) For any 7 S A(-^V) and any positive integer n \ N/Mf, show that 

ISSlTlSSr'eAtM;). 

(b) Suppose that the path a : [0, 1] — > H is the lift of a loop in Xi(N). 
Show that the path a(t) = na(t) is the lift of a loop in Xi(Mf). 

6.6.5. (a) Show that diagram (6.19) commutes. (Hints for this exercise are at 
the end of the book.) 

(b) Show that diagram (6.20) commutes. 
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Modular Curves as Algebraic Curves 



Recall that every complex elliptic curve is described as an algebraic curve, 
the solution set of a polynomial equation in two variables, via the Weierstrass 
p- function, 

(p, p') : C/A — ► {{x, y):y 2 = 4x 3 - g 2 (A)x - g 3 (A)} U {c»}. 
Let N be a positive integer. The modular curves 

x 0 (n) = r 0 (N)\u*, Xi(jv) = x(jv) = r(N)\n* 

can also be described as algebraic curves, solution sets of systems of polyno- 
mial equations in many variables. Since modular curves are compact Riemann 
surfaces, such polynomials with complex coefficients exist by a general theo- 
rem of Riemann surface theory, but Xq(N) and X\(N) are in fact curves over 
the rational numbers, meaning the polynomials can be taken to have rational 
coefficients. The Modularity Theorem in its various guises rephrases with the 
relevant complex analytic objects replaced by their algebraic counterparts and 
with the relevant complex analytic mappings replaced by rational maps, i.e., 
maps defined by polynomials with rational coefficients. The methods of this 
chapter also show that X(N) is defined by polynomials with coefficients in 
the field Q(fi N ) obtained by adjoining the complex Nth roots of unity to the 
rational numbers. 

The process of defining modular curves algebraically is less direct than 
writing a polynomial explicitly via the Weierstrass p-function as we can do 
for elliptic curves; it works with fields of functions on the curves rather than 
with the curves themselves. According to the moduli space point of view from 
Section 1.5, a modular curve is a set of equivalence classes of elliptic curves 
and associated torsion data, and that description will emerge again naturally 
during this chapter. Thus the moduli space description of modular curves 
carries over to the algebraic context. The descriptions of the Hecke operators 
in terms of modular curves and moduli spaces rephrase algebraically as well. 
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Related reading: The main topics of this chapter are in Chapters 4 and 6 
of [Shi73]. Chapter 6 of [Lan73] also overlaps with this chapter's material on 
modular function fields. The presentation here is drawn from David Rohrlich's 
article in the proceedings volume from the 1995 Boston University conference 
on modular forms and Format's Last Theorem [CSS97] and from early parts 
of Joseph Silverman's book on elliptic curves [Sil86] . Other books on elliptic 
curves are [ST92], [Was03], [Kob93], [Kna93], [Hus04], and [Sil94], the first 
two especially inviting for beginners, the third emphasizing the connections 
between elliptic curves and modular forms, and the fourth discussing much of 
the material here. The notes at the end of [Kna93] give an overview of sources 
up through 1992 for much of the material in Chapters 7 through 9 of this 
book. For algebraic curves in general, [Ful69] is an excellent introduction. 

7.1 Elliptic curves as algebraic curves 

Seeing how the description of a modular curve as an algebraic curve reflects 
its description as a moduli space — essentially a family of elliptic curves and 
torsion data — first requires understanding elliptic curves as algebraic curves. 
Up to now we have worked mostly over the field C, but this chapter will use 
function fields both over C and over Q to discuss algebraic curves, so let k 
denote any field of characteristic 0. Fields of arbitrary characteristic will be 
considered in the next chapter. 

Definition 7.1.1. An element a of an extension field K of k is algebraic 
over k if it satisfies some monic polynomial with coefficients in k. Otherwise 
a is transcendental over k. A field extension K/k is algebraic if every 
a G K is algebraic over k. A field k is algebraically closed if there is no 
proper algebraic extension K/k; that is, in any extension field K of k, any 
element a that is algebraic over k in fact lies in k. An algebraic closure k 
of k is a minimal algebraically closed extension field o/k. 

Every field has an algebraic closure. Any two algebraic closures k and k 
of k are k-isomorphic, meaning there exists an isomorphism k — ^ k fixing 
k pointwise, so we often refer imprecisely to "the" algebraic closure of k. In 
Chapter 6 the algebraic closure Q was unique because it was set inside the 
fixed ambient field C. 

A Weierstrass equation over k is any cubic equation of the form 

E : y 2 = 4x 3 -g 2 x-g 3 , <?2,ff 3 ek. (7.1) 
As in Section 1.1, define the discriminant of the equation to be 

A = g\- 21 g\ e k, 
and if A ^ 0 define the invariant of the equation to be 



j = 1728g 3 2 /A e k. 
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Definition 7.1.2. Let k be an algebraic closure of the fieldk. When a Weier- 
strass equation E has nonzero discriminant A it is called nonsingular and 
the set 

2 

£ = {(x, y) e k satisfying E(x, y)}U {oo} 
is called an elliptic curve over k. 

Thus an elliptic curve always contains the point oo. As the solution set of 
a polynomial equation in two variables, an elliptic curve as defined here is a 
special case of a plane algebraic curve. The term "algebraic curve" is being 
used casually so far, but it will be defined in the next section after some of 
the ideas are first demonstrated here in the specific context of elliptic curves. 

Strictly speaking we are interested in equivalence classes of elliptic curves 
under admissible changes of variable, 

x = u 2 x', y = u 3 y', iifk*. 

These transform Weierstrass equations to Weierstrass equations, taking g 2 
to g 2 /u 4 , 53 to 53/u 6 , the discriminant A to A/u 12 (again nonzero) and pre- 
serving the invariant j (Exercise 7.1.1). Admissible changes of variable are 
special cases of isomorphisms between algebraic curves, a topic to be dis- 
cussed more generally in the next section. 

For example, consider an elliptic curve £ over C whose invariant is rational. 
Supposing 52 and g 3 arc nonzero, the condition j e Q is g\ = rg\ for some 
nonzero re Q (Exercise 7.1.2(a)). Let u € C satisfy g 2 /u 4 = r; then also 
r 3 u 12 = g\ = rg 2 and thus c/3/u 6 = ±r, and we may take g 3 /u e = r after 
replacing u by iu if necessary. Since the admissible change of variable x — u 2 x' , 
y = u 3 y' produces new Weierstrass coefficients g 2 = g 2 /u 4 and g' 3 = 53 /w 6 , 
this shows that up to isomorphism £ is defined over Q by a Weierstrass 
equation 

y 2 = 4x 3 - gx - g, g G Q. 

A separate argument when one of g 2 , <?3 is zero (Exercise 7.1.2(b)) shows that 
in all cases an elliptic curve over C has rational invariant j € Q if and only if 
it is isomorphic over C to an elliptic curve over Q. This argument works with 
C replaced by any algebraically closed field k of characteristic 0 and with Q 
replaced by any subfield f of k. It shows that any two elliptic curves over k 
with the same invariant j are isomorphic when k is algebraically closed. 

Associated to each Weierstrass equation and also denoted E is a corre- 
sponding Weierstrass polynomial, 

E(x, y) = y 2 - Ax 3 + g 2 x + g 3 e k[x, y]. 

The Weierstrass equation (7.1) is nonsingular, meaning as in Definition 7.1.2 
that its discriminant A is nonzero, if and only if the corresponding curve £ is 
geometrically nonsingular, meaning that at each point (x,y) € £ at least one 
of the partial derivatives D\E(x,y), D 2 E(x,y) of the Weierstrass polynomial 
is nonzero. To see this, note that the Weierstrass polynomial takes the form 
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E(x,y) = y 1 -4(.t 



xi){x - x 2 )(x 



xi,x 2 ,x 3 e k. 



The third condition of 



E(x,y)=0, 



D 1 E(x,y) = 0, 



D 2 E(x,y) = 0 



is y = 0 since char(k) = 0. Now the first condition is x £ {xi, x 2l X3}, making 
the second condition impossible exactly when X\, x 2 , x 3 are distinct, i.e., when 
A 7^ 0. One can think of geometric nonsingularity as meaning that £ has a 
tangent line at each point. 

Many good texts on elliptic curves exist, cf. the references at the begin- 
ning of the chapter, so we state without proof the facts that we need. Most 
importantly, 

every elliptic curve forms an Abelian group with the point 00 as its 
additive identity. 

The point 00 is therefore denoted Of from now on. One can think of addition 

on £ geometrically, algebraically, and analytically. 

Geometrically the elliptic curve really sits in the projective plane over k, 

2 

denoted P 2 (k), the usual plane k (called the affine plane) along with some 

additional points conceptually out at infinity added to complete it. (The reader 
who is unfamiliar with this construction should work Exercise 7.1.3 for details 
of the following assertions.) The projective plane is the union of three overlap- 
ping affine planes and the elliptic curve is correspondingly the union of three 
affine pieces. The projective point 00 of £ is infinitely far in the y-direction, 
i.e., Of — [0, 1, 0] in projective notation. The curve can be studied about Of by 
working in a different affine piece of P 2 (k), and it is geometrically nonsingular 
at Of as it is at its finite points. Projective space is a natural construct, e.g., 
the Riemann sphere CU {00} is P 1 (C), and similarly the set {0, . . . ,p— 1, 00} 
that often serves as an index set in the context of the Hecke operator T p can 
be viewed as P^Z/pZ). 

Bezout's Theorem says that if C\ and C 2 are plane curves over k whose 
defining polynomials have degrees d\ and d 2 and are relatively prime in k[x, y] 
(a unique factorization domain) then their intersection in P 2 (k) consists of 
d\d 2 points, suitably counting multiplicity. Thus cubic curves, and only cubic 
curves, naturally produce triples P, Q, R of collinear points, meaning projec- 
tive points satisfying an equation ax + by + cz = 0 where a, b, c € k are not all 
zero. If the line is tangent to the curve then two or three of P, Q, and R will 
coincide. The addition law on elliptic curves £ is that collinear triples sum to 
0 £ . That is, 



We saw this idea back in Section 1.4. In particular, since P — P + 0g = 0g it 
follows that P, —P, and Of are collinear. So far this only requires Of to be any 
point of £, but the condition Of = [0, 1,0] gives the addition law a pleasing 



P + Q + R = Of P, Q, R arc collinear. 
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geometry. Since is infinitely far in the vertical direction, P and —P have 
the same ^-coordinate. At most two points with the same ^-coordinate satisfy 
the Weierstrass equation (7.1), so any two points with the same x-coordinate 
are equal or opposite, possibly both. Since the y- values satisfying (7.1) for a 
given x sum to 0 the additive inverse of P — (xp, yp) is the natural companion 
point 

-P= (x P ,-y P ). 

As remarked, this could well be P again. More generally, given points P and Q 
of £, let R be their third collinear point. The addition law P + Q = —R says 
that the sum is the companion point of R, its reflection through the ir-axis, 

if P, Q, R are collinear then P + Q = (xp, —yp). (7.2) 

Figure 7.1 illustrates this for the elliptic curve y 2 = 4a; 3 — Ax. 




Moving from geometry to algebra, the group law is defined by rational 
functions over the field k. Let P = (xp,yp) and Q = {xq^q) be nonzero 
points of the curve, and suppose their sum P+Q = (xp + Q,yp + Q) is nonzero 
as well. Then 

xp+q = r(xp,y P ,XQ,y Q ) and y P+Q = s(xp,y P ,XQ,y Q ) 

where r and s are rational functions with coefficients in k. Even more specifi- 
cally since char(k) = 0, r and s are rational functions over the field Q(<72,<?3)- 
If xq = x P and yq = —y P then Q = —P and so P + Q = O5. Otherwise P + Q 
lies in the affine part of £. Let 

( x Q y P - x P y Q 



A 



VQ ~ VP 

Xq - X P 

Ylxp 



92 



2^ 



xp ^ x Q , 



XP = XQ, 



-Ax\ 



- x P 

- 92Xp 
2y P 



xp ^ x Q , 



xp = Xq. 
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The line y — Xx + \i passes through P and Q when P ^ Q and is the tangent 
line to £ at P when P = Q (Exercise 7.1.4(a)). The casewise nature of A 
and n will be discussed further in the next section. In either case, the rational 
functions giving xp + q and yp+Q are 

r(xp,y P ,x Q ,y Q ) = \ 2 /4- x P - x Q , 
s(x P ,yp,XQ,y Q ) = -\r{xp,y P ,XQ,y Q ) - /i. 

This algebraic definition of addition corresponds to the geometric descrip- 
tion (7.2) (Exercise 7.1.4(b)). For example, on the curve y 2 = 4x 3 — Ax of 
Figure 7.1, (0,0) + (2,2\/6) = (-1/2, V&/2). 

Because the coordinates of Of = [0,1,0] lie in k and because the group 
law is rational over k, for any algebraic extension K/k (i.e., k C K C k) the 
set of K.-points of £ is a subgroup of £, 

£(K) = {Pe£- {0 £ } : (x P ,yp) E K 2 } U {0 £ }. 

In particular if £ is an elliptic curve over Q then its affine rational points and 
0 £ form a group under the addition law. A special case of the Mordell-Weil 
Theorem states that this group is finitely generated. [ST92] is an appealing 
text on this subject. 

As explained earlier, to understand modular curves we also want to study 
the torsion structure of £ algebraically. For any positive integer N let 

[N] : £ — ► £ 

denote AT-fold addition, e.g., [2]P = P + P. This takes the form of a rational 
function, 

\^N(x,y) 2 ^ N (x,y) 3 J 

Here the Nth division polynomials 4>n, ^n, lie in Z[§2, 33, x, y], and (oo, oo) 
is understood to mean 0 £ . Thus the condition [iV](a;, y) = 0g is iPn( x , y) = 0. 
This works out to a polynomial condition on x alone, 

[N](x,y)=0 £ >^=> ■4) N (x) = 0, %() N e Z [g 2 ,93,x] ■ 

For instance, the condition [2]P = 0 for a nonzero point P is yp = 0, or 
4xp — g 2 xp — (73 = 0. Similarly, the condition that [3]P = 0 for nonzero P is 
[2]P = —P, or x([2]P) = x P , or r(xp 7 yp,x P ,y P ) = x P , and this works out 
to 48x P — 2Ag 2 x 2 p — A8g 3 x P — g\ = 0 (Exercise 7.1.5(a)). The polynomial 
will be used in Section 7.5. 

Let £[N] denote the group of N -torsion points of £, the kernel of [N], 

£[N] = {Pe£: [N]P = 0 £ }. 

We quote the structure theorem for £[N]. 
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Theorem 7.1.3. Let £ be an elliptic curve over a field k of characteristic 0 
and let N be a positive integer. Then £[N] = (Z/NZ) 2 . 

Let K be a Galois extension field of k containing the x- and y-coordinates 
of £[N] - {Of;}. The relations E(x a ,y <J ) = E(x,yY and ■4> N (x' 7 ) = ■4> N (x) a 
for any x,y G K and any automorphism a G Gal(K/k) show that the Galois 
group acts on £[N]. The action is an automorphism since the coefficients 
of the rational functions r and s lie in Q(<?2,</3) and hence in k. That is, 
r(x P ,y P ,x^,y^) = r(xp,yp,XQ,y Q )' 7 for all P, Q e £[N], and similarly for s, 
making P a + Q a = (P + Q) a . Since £[N] ^ (Z/iVZ) 2 , once an ordered basis 
(P, Q) of £[N] over Z/NZ is chosen this gives a representation 

p : Gal(K/k) — ► GL 2 (Z/iVZ), 

where as always GL2(Z/A^Z) is the group of invertible 2-by-2 matrices with 
entries in Z/NZ. We will return to this idea later in the chapter and again 
more extensively in Chapter 9. 

Analytically, when k C C we may view the coefficients of the Weierstrass 
equation as complex numbers. As shown in Proposition 1.4.3, there exists 
a lattice A c C such that the Weierstrass equation (7.1) takes the form 
y 2 = 4x 3 — g2(A)x — g$ (A), and addition on the corresponding curve £ is 
compatible with the natural addition on C/A via the Weierstrass function p. 
In particular, Theorem 7.1.3 here is clear and familiar when k = C. Also as 
shown in Section 1.4, holomorphic isomorphisms of complex tori correspond 
to admissible changes of variable in complex Weierstrass equations. 

Exercises 

7.1.1. Show that every admissible change of variable x = u 2 x' , y — u 3 y' 
where nek* transforms a Weierstrass equation E into another Weierstrass 
equation E' with 

u 4 g' 2 = g 2 , u e g' 3 = g 3 , u 12 A' = A, f = j. 

7.1.2. (a) Confirm that for a Weierstrass equation (7.1) with g 2 and gz nonzero 
the condition j e Q is equivalent to the condition g 2 = rg 2 for some nonzero 
reQ. 

(b) Show that an elliptic curve over C with either of 52 , 33 zero is isomor- 
phic to an elliptic curve over Q. 

7.1.3. For any positive integer n and any field K, n- dimensional projective 
space over K is the set of equivalence classes of nonzero (n+ l)-tuples modulo 
scalar multiplication, 



po 

Q° 



P"(K) = (K" +1 -{0})/- 
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where v ~ v' if v' = cv for some nonzero c € K. Let [v] € P"(K) denote the 
equivalence class of the vector v. 

(a) When n = 1 this construction gives the projective line. Show that 

P\K) = {[x,l] :xeK}U{[l,y] : y G K}, 

so the projective line is an overlapping union of two affine lines. Show also 
that 

P 1 (K) = {[x,l] :ieK}U{[l,0]}, 

so the projective line is a disjoint union of the line and a point at infinity. In 
particular, P 1 (C) is the Ricmann sphere C. 

(b) When n = 2 the construction gives the projective plane mentioned in 
the section. Show that 

P 2 (K) = {[x, y, 1] : x, y e K} U {[x, 1, z] : x, z e K} U {[l,y, z]:y,z£ K}, 

so the projective plane is a union of three affinc planes. Show also that 

P 2 (K) = fe!/,l]:i,!/eK}U{[x,l I 0]:xGK}U{[l,0,0]} 
= {[x,y,l]:x,y€K}U(P 1 (K)x{0}), 

so the projective plane is a disjoint union of the plane and a projective line 
at infinity. 

(c) Homogenize the Weierstrass polynomial by adding in powers of z to 
make each term cubic, 

Ehom(x, y, z) = y 2 z - 4a; 3 + g 2 xz 2 + g 3 z 3 . 

Show that either all points or no points in each equivalence class [x, y, z] <G 
P 2 (k) satisfy -Ehom- Show that [0,1,0] satisfies -Ehom and no other [x,y, 0] 
does. 

(d) Dehomogenize -Ehom by setting y = 1 to obtain a second affine version 
of E, 

E'(x, z) = z — 4x 3 + g 2 xz 2 + g 3 z 3 . 

In the (x, z) coordinate system, the infinite point 0^ is (0, 0). Working in this 
affinc coordinate system, show that £ is geometrically nonsingular at Of. Is 

2 

the set of points (x, z) £ k satisfying E'(x, z) all of £? 

(e) Let P = (xp,yp) be any (a;, y)-point of £, and homogenize it to P = 
[xp, Up, 1]- Show that the homogeneous equation x — xpz = 0 is satisfied by P 
and by 0^, thus defining the projective line containing them. Dehomogenize 
back to an (x, y)-equation to obtain a vertical line. Thus 0^ is infinitely far in 
the vertical direction. 

7.1.4. (a) Show that the casewise definitions of A and fi make the line y = 
\x + n the secant line through P and Q when P ^ Q and the tangent line 
to £ through P when P = Q. 
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(b) Show that the algebraic definition (7.3) of elliptic curve addition cor- 
responds to the geometric description (7.2). (A hint for this exercise is at the 
end of the book.) 

(c) The points (2, ±5) satisfy the equation y 2 = 4x 3 — 7. Find another 
point (x,y) G Q 2 that does so as well. 

7.1.5. (a) Confirm that the condition x([2]P) = xp works out to the polyno- 
mial condition given in the section. 

(b) Compute £ [2] and £ [3] for the elliptic curve with Weierstrass equation 
y 2 = 4x 3 - Ax. 

7.2 Algebraic curves and their function fields 

Describing modular curves as algebraic curves without writing specific poly- 
nomial equations requires some general theory. The idea is that curves are 
characterized by fields, and so to construct curves it suffices to construct ap- 
propriate fields instead. Similarly, mappings between curves are characterized 
by field extensions, and so to construct mappings it suffices to construct ap- 
propriate extensions. This section sketches some background and then states 
the theorems. 

Let k be any field of characteristic 0. Let to, n be positive integers. Consider 
a set of m polynomials over k in n variables, ip± , ... , ip m G k[xi, . . . , x n ] . Let / 
be the ideal generated by the ipi in the ring of polynomials over the algebraic 
closure of k, 

J = fa, tp m ) c k[xi, . . . ,x n ], 

and let C be the set of simultaneous solutions of the polynomials, or equiva- 
lcntly the simultaneous solutions of all polynomials in the ideal, 

C = {P G k™ : (f(P) = 0 for all (f G I}. 

Suppose that / is a prime ideal of k[xi, . . . , x n \. The coordinate ring of C 
over k is the integral domain 

k[c] = k>!,..., *„]//. 

An element of the coordinate ring is called a polynomial function on C . The 
function field of C over k is the quotient field of the coordinate ring, 

k(C) = {F =f/g:f, g G k[C], g + 0}. 

The function field really consists of equivalence classes F of ordered pairs 
(/,<?) G k[C] 2 where the equivalence is determined by the usual rule for frac- 
tions, (/, g) <~ (f',g') if fg' = f'g (primes do not denote differentiation here 
or elsewhere in the chapter), but the notation F = f/g is less cumbersome. 
An element of the function field is called a rational function on C. 
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Definition 7.2.1. Ifk(C) is a finite extension of a field k(t) where t is tran- 
scendental over k then C is an affine algebraic curve over k. If also for 
each point P e C the m-by-n derivative matrix [Djipi(P)] has rank n—1 then 
the curve C is nonsingular. 

The definition shows that if C is an affine algebraic curve over k then 
m > n — i.e., the number of constraints on the variables is at least the 
geometrically obvious minimal number for a curve. After we introduce more 
ideas it will be an exercise to show that the nonsingularity condition is a 
maximality condition on the rank of the derivative matrix, i.e., the rank cannot 
be n. 

Homogenize the polynomials ip t with another variable x 0 , consider the 
ideal 

horn = (<P*,hom) C k[x 0 , ...,X n ] 

generated by the homogeneous polynomials but also containing inhomoge- 
neous ones (unless 4om = {0}), and consider the set 

Chom = {P € P"(k) : (fi(P) = 0 for all homogeneous ip G Ihom} • 

This is the projective version of the curve, a superset of C, the union of 
the affine curves obtained by dehomogenizing the i^i.hom at each coordinate, 
cf. Exercise 7.1.3(d). The definition of nonsingularity extends to Chom by 
dehomogenizing to appropriate coordinate systems at the points outside of k . 
In this chapter we are interested in nonsingular projective algebraic curves, but 
for convenience we usually work in the (xi, . . . , x n ) affine coordinate system, 
illustrating some projective ideas in examples and exercises. 

Setting to = 0 and n — 1 shows that the projective line P : (k) is a nonsin- 
gular algebraic: curve over k with function field k(x). When rn = 1 and n = 2 
and the single polynomial defining C is a Weierstrass polynomial E(x, y) then 
C is an elliptic curve £ as in Section 7.1. The function field is k(x,y) where 
x and y are related by a Weierstrass equation, or k(x)[y]/ (E(x,y)} , showing 
that it is quadratic over k(x) . Thus elliptic curves over k are indeed nonsingu- 
lar projective algebraic curves over k as in Definition 7.2.1 and the preceding 
paragraph. The projective line can also be placed in the m = 1, n = 2 context 
by taking ip(x, y) = y. At the end of the section we will see why the definition 
of algebraic curve has not been limited to to = 1 and n = 2, i.e., to plane 
curves. 

Let C be a nonsingular algebraic curve over k. Since C is defined by the 
condition ip(P) = 0 for all ip e I, an clement / + / of the coordinate ring takes 
a well defined value in k at each point P e C even though its representative / 
is not unique. That is, a polynomial function on C, a formal algebraic object, 
also defines a mapping from the set C to k. A priori it is not clear that an 
element F = f/g of the function field takes a well defined value in k U {oo} 
at each P e C since f(P)/g(P) might evaluate to 0/0, but we will see that 
it does and therefore a rational function F € k(C) defines a mapping from C 
to P x (k). In fact it defines a mapping on all of Chom- 
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Let P = (pi, . . . ,p n ) be a point of C. To study rational functions on C 
at P start with the elements of the coordinate ring that vanish at P, 

m P = {/ G k[C] : /(P) = 0} = {{ Xl - Pi + 1 : 1 < i < n}). 

This is a maximal ideal of k[C] (Exercise 7.2.2), and its second power is 

mp = ({(xi - Pi)(xj - pj) + 1 : 1 < i,j < n}). 

Lemma 7.2.2. Let C be a nonsingular algebraic curve. For any point P € C , 
the quotient mp/mp is a 1- dimensional vector space overk. 

Proof. The tangent line to C at P is 

T P (C) = {vek n : [D m (P)]v = 0}, 

a 1-dimcnsional vector space over k by the Rank-Nullity Theorem of linear 
algebra since C is nonsingular. 

Although a function f + I E k[C] has a well defined value at P, its form 
as a coset makes its gradient (vector of partial derivatives) at P ill defined. 
But changing coset representatives changes the gradient to 

V(/ + ^)(P) = V/(P) + V^(P), pel, 

and since Vtp(P) is in the row space of [Djipi(P)] (Exercise 7.2.3(a)) the 
definition of the tangent line shows that the inner product of the gradient 
V/(P) with any v e Tp(C) is well defined. Translating / by a constant has 
no effect on the inner product, and any function / G mp satisfies Vf(P)-v = 0. 
Thus there is a well defined pairing 

mp/mp x T P {C) — > k, (/, v) H- V/(P) • v. 

This pairing is perfect, meaning that it is linear and nondegencrate in each 
component, cf. the proof of Proposition 6.6.4 (Exercise 7.2.3(b)). It follows 
that mp/mp and T. p {C) have the same dimension as vector spaces over k 
since each can be viewed as the dual space of the other. Since the tangent line 
T p (C) has dimension 1, so does mp/mp. □ 

For example, let £ be an elliptic curve defined by a Weierstrass polyno- 
mial E. The proof of Lemma 7.2.2 shows that x — x P + mp spans mp/mp if 
D 2 E(P) 0, meaning the tangent line at P is not vertical, and y — yp + mp 
spans mp/mp if D\E{P) ^ 0, meaning the tangent line is not horizontal 
(Exercise 7.2.3(c)). 

The local ring of C over k at P is 

k[C]p = {f/9 e k(C) : g(P) ? 0}, 



a subring of the function field. The local maximal ideal at P is the ideal of 
functions that vanish at the point, 
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M P = mpk[C] P = {f/g e k[C] P : f(P) = 0}. 

This is a maximal ideal of k[C]p, and since it consists of all the noninvertible 
elements it is the unique maximal ideal (this is not true for mp and k[C]). 
Consider the natural map from the maximal ideal to the quotient of its local- 
ization, 

i:mp — > Mp/Mp, / / + Mp. 

The kernel is mp n Mp = mp (Exercise 7.2.3(d)). The map surjects since for 
any f/g e M P the subtraction f/g(P) - f/g = f(g - g(P))/(g(P)g) e M 2 
shows that i(f/g(P)) = f/g + Mp. Thus i induces an isomorphism 

i : mp/mp M P /Mp. 

Proposition 7.2.3. For any nonsingular point P € C the ideal Mp is prin- 
cipal. 

Proof. By the lemma and the isomorphism i, some function t € k[C] generates 
Mp/Mp as a vector space over k. This function will also generate Mp as 
an ideal of k[C]p. Letting N — ik[C]p, an ideal of Mp, it suffices to show 
that the quotient M P /N is trivial. Since M P is a k[C]p-module, so is the 
quotient. Noting that M P ■ (M P /N) = (N + Mp)/N and that N + Mp = 
(kt + Mp)k[C] P = M P shows that M P ■ (M P /N) = M P /N. By Nakayama's 
Lemma (Proposition 6.5.5), Mp/N — 0 as desired. □ 

The proposition requires the local ring to apply Nakayama's Lemma. The 
results so far in this section allow an argument (Exercise 7.2.4) that the con- 
dition for a curve to be nonsingular is a maximality condition on the rank of 
the derivative matrix, as stated after Definition 7.2.1. 

A generator t of Mp is called a uniformizer at P. Any t € mp — mp is 
a uniformizer but t need not generate mp. For example, let £ be the elliptic 
curve over Q with Weierstrass equation E : y 2 = 4x 3 — Ax. The remark after 
the proof of Lemma 7.2.2 shows that y is a uniformizer at the point P = (0, 0) . 
Indeed the Weierstrass relation x = y 2 /{Ax 2 — 4) shows that x is redundant as 
a generator of Mp in k[£]p. On the other hand, x is needed along with y to 
generate mp in k[£]. To see this, note that Q[£]/(y) = Q[x]/(x 3 —x) and this 
is not a field, showing that (y) is not maximal in Q[£] and thus is not mp. 

Let F be any nonzero element of k[C]p and let t be a uniformizer at P. 
Either F is a unit or it takes the form F = tF\ where F\ is a nonzero clement 
of k[C]p. Either Fx is a unit or it takes the form F\ = tF 2 , and so on. If 
this process doesn't terminate then it leads to an ascending chain of ideals 
in k[C] P , 

(F) C (F^ C (F 2 ) C • • • , 

where each containment is proper. But k[C]p is Noetherian (see Exer- 
cise 7.2.5), so this cannot happen. Therefore F takes the form F = t e u where 
eeN and u € k[C] P . This representation of F is unique, for if F = t e u = t e u' 
with e > e' then t e ~ e u = u' , showing that e = e' and u — is! . 
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The valuation at P on the coordinate ring is the function 

. P :k[CHNU{H M.f) = 

This extends to the function field, 

up : k(C) -^ZU {+cx)}, vp{F) = - i/pfo), F = //<?, 

where it is well defined and it satisfies (Exercise 7.2.6) 

Proposition 7.2.4. Let C be a nonsingular algebraic curve. For any point 
P e C the valuation vp : k(C) — > Z U {+00} surjects, and for all nonzero 
F,G € k(C) it satisfies the conditions 

(1) u P (FG) = v P {F) + v P (G), 

(2) v P {F + G) > mm{v P (F),i> P (G)} with equality if v P {F) ^ v P {G). 

Now we can describe the mapping from C to P : (k) defined by a rational 
function F <E k(C). The zero function defines the zero mapping. Any nonzero 
function takes the form 

F = t Vpl - F ^-, v P (F) e Z, f,g£ k[C], f(P) and nonzero 

at each point PeC. The mapping it defines is 

!0 if v P (F) > 0, 

00 if u P {F) < 0, 

f(P)/g(P) HMF) = 0- 

The valuation is the algebraic analog of the order of vanishing for meromorphic 
functions at a point P of a Riemann surface, and so one speaks of the zeros 
and poles of a rational function in the usual way. 

Since these methods are local they work on all of Chom, using an appro- 
priate coordinate system at each point. It turns out that dehomogenizing the 
polynomials defining a nonsingular projective curve Chom to define one of its 
affine pieces leads to the same function field up to isomorphism whenever 
the affine piece is nonempty, regardless of which variable is set to 1 , and the 
isomorphism between two affine function fields associated to Chom preserves 
valuations on the overlap of the two corresponding affine pieces of the curve. 
We will touch on this point, but not prove it completely, in Section 8.5. Also, 
the valuations on k(C), meaning the surjections v : k(C) — > Z U {+00} 
satisfying the conditions of Proposition 7.2.4, are precisely the valuations v P 
arising from the points of the projective curve Ch om (for the proof see Propo- 
sition 11.43 of [Kna93]). For these two reasons it is natural to conceive of 
the function field as being associated with the projective curve rather than 



+00 if / = 0, 
e if / = t e u. 
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with an affine piece. That is, even when C is projective we can write k(C), 
interpreting it as the function field of an affine piece of C. The notation Ch om 
to distinguish a projective curve will no longer be used. 

For example, let £ be an elliptic curve over k and consider the rational 
functions x and y, elements of k(£). These make sense as mappings on the 
affine part of £. To study them at the infinite point Of: = [0, 1,0], homogenize 
them to x/z and y/z and then dehomogenize to x/z and 1/z, introducing the 
tildes to emphasize the different coordinate system. In these coordinates the 
Weierstrass polynomial is E'(x, z) — z — Ax 3 + gixz 2 + (fez 3 . This has nonzero 
second partial derivative at (0,0), showing that a; is a uniformizer at Of, i.e., 
vq £ (x) = 1. The Weierstrass relation becomes 

z(l + g 2 xz + g 3 z 2 ) = Ax 3 , 

and so Proposition 7.2.4 shows that vq £ (z) = 3. Thus the rational functions 
x and y have valuations vq £ (x) = uq £ (x/z) = 1 — 3 = — 2 and vo £ (y) = 
vq £ (1/z) = —3, making x/y a uniformizer at 0g. Working over the complex 
numbers and thinking of x and y as p and p', this analysis reflects their double 
and triple poles at the origin. 

To summarize the flow of ideas so far: Lemma 7.2.2 translates the geo- 
metric data that a nonsingular algebraic curve has a one-dimensional tangent 
line at each point into algebraic information that a vector space associated to 
each point is one-dimensional. Proposition 7.2.3 says that after localizing, this 
makes a maximal ideal at each point principal. The Noethcrian property of 
the local ring then leads to a valuation at each point. The uniformizer and the 
valuation show that the curve's rational functions give well defined mappings 
on the entire curve. 

Different representations of a rational function F may be required to eval- 
uate it at different points. The idea is that an expression giving 0/0 at a point 
yields the correct value after its numerator and denominator are divided by 
their common power of a uniformizer. For example, consider the function x/y 
on the elliptic curve over Q with Weierstrass equation E : y 2 = Ax 3 — Ax. To 
determine its behavior at the point P = (0, 0), recall the discussion earlier that 
y is a uniformizer at P. This shows that x = y 2 /(Ax 2 — 4) has order 2, giving 
a simple zero of x/y at P. Equivalcntly, x/y rewrites as y/(Ax 2 — 4) (here is 
where the numerator and denominator get divided by the uniformizer), again 
giving a simple zero at P. On the other hand, the value of the new expres- 
sion y/(Ax 2 — A) for x/y is now longer immediately clear at the points (±1,0) 
of the curve. 

For a more elaborate example, let £ be any elliptic curve over any field k as 
in Section 7.1, and let P be any nonzero point of £. The translation function 

t p :£—>£, t p (X)=X + P 

has as its components the rational functions F(x,y) = r(xp,yp,x 7 y) and 
G(x,y) = s(xp,yp,x,y) from Section 7.1, using the general case expressions 
for the slope and intercept A and /i, 
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\ y-yp xy P - x P y 
A = , ii = . 

X — Xp X — Xp 

These expressions for A and /i are not well defined when x = xp, but multiply- 
ing the numerators and denominators by y+yp and then using the Weierstrass 
equation (7.1) to substitute for y 2 and yp and then canceling x — xp shows 
that the general expressions for A and /x are also (Exercise 7.2.7) 

_ A(xp + x P x + x 2 ) - g 2 _ -4(x P x + x P x 2 ) - g 3 + y P y 
A — , /I — . 

yp + y yp + y 

These are defined when x = xp provided y ^ —yp, when if also y = yp 
then they are the special case expressions for A and /i from Section 7.1. The 
new expressions are no panacea, however, since they are not well defined 
when y = —yp even when x =/= xp. Nonetheless, the point is that despite 
their casewise expressions, the components F and G of Tp are single rational 
functions on £. 

As explained at the beginning of this section, we need a general result from 
algebraic geometry that curves are determined by their function fields. The 
precise statement requires appropriate equivalence relations both on curves 
and on fields. 

To define the equivalence relation on curves, let C be a nonsingular projec- 
tive algebraic curve over k. Let r be a positive integer. Consider any element 
ofP-(k(C)), 

h=[F 0 ,...,F r ]. 

For each point P e C let tp be a uniformizer at P and let vp(h) = 
min{^p(F 0 ), . . . , vp{F r )}. Then h defines a map from C to P r (k), 

h(P) = [(t-/ p(h) F Q )(P), . . . , (tp vpW F r )(P)], PeC. 

Let C C P r (k) be another nonsingular algebraic curve over k. Then ft, is a 
morphism from C to C if h(P) G C for all PeC. The affine notation is 
h = (F 1 /F 0 , . . . , F r /F 0 ) € k(C) r when F 0 ^ 0, but this needn't map points P 
of the affine piece of C back to affine space when Fq(P) = 0. For instance, 
every element of k(C) defines a morphism from C to P 1 (k), and the example 
Tp from a moment ago is a morphism from £ to £. A morphism h from C 
to C is an isomorphism if there exists a morphism h! from C to C such that 
the composites h! oh and h o h! are the respective identity maps on C and C, 
in which case C and C are isomorphic. 

Isomorphism is an equivalence relation but it is not fine enough for our 
purposes. Again letting C be a nonsingular projective algebraic curve over k, 
an element F of k(C) is defined over k if it has a representation F = f/g with 
/ and g from k[C] = k[xi, . . . , x n ]/Ik (where Ik = I fl k[xi, . . . , x n ]) rather 
than k[C]. Such functions form a subficld of k(C), 



k(C) = {Fe k(C) : F is defined over k}. 
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A morphism h = (F\, . . . , F r ) of curves over k is defined over k if each Fi 
is. For instance, every element of k(C) defines a morphism over k from C 
to P 1 (k), and Tp is a morphism over k. The equivalence relation we need is 
that two nonsingular projective curves C and C over k are isomorphic over k 
if they are isomorphic via morphisms h and h! over k. See Exercise 7.2.8 for 
an example. In the case of two elliptic curves the only possible isomorphisms 
over k taking Of to Of turn out to be the admissible changes of variable 
from Section 7.1. As affine maps these take lines to lines, making them group 
isomorphisms as well. Two curves defined over a field k can be isomorphic 
over k without being isomorphic over k itself. Exercise 7.2.9 gives an example. 

To define the needed equivalence relation on function fields, call a field K 
a function field over k if 

(1) Knk = k, 

(2) K is a finite extension of a field k(t) where t is transcendental over k. 

Two function fields K and K' over k are conjugate over k if there exists a 
k- isomorphism K — > K', meaning an isomorphism that fixes k pointwise. 
The relation between curves and their function fields is 

Theorem 7.2.5 (Curves Fields Correspondence, Part 1). The map 

C k(C) 

induces a bijection from the set of isomorphism classes over k of nonsingular 
projective algebraic curves over k to the set of conjugacy classes over k of 
function fields over k. 

The importance of Theorem 7.2.5 is that rather than constructing algebraic 
curves directly it now suffices to construct suitable fields instead. 

The theorem gives the map from curve-classes to field-classes explicitly. To 
describe the map from field-classes to curve-classes, let K be a function field 
over k. Since the extension K/k(t) is finite and we are in characteristic 0 the 
Primitive Element Theorem of field theory says that K = k(i, u) for some u. 
The generator u satisfies an irreducible polynomial over k(i), and clearing 
denominators gives a polynomial relation between t and u over k, 

<p(t,u)=0, f£k[x,y}. 

Because Knk = k, the polynomial ip is irreducible over k (Exercise 7.2.10). 

2 

The set of points (x,y) € k such that <p(x,y) — 0 gives a plane curve C", 
but this curve can have finitely many singular points, and at these points 
rational functions need not define mappings. A process called desingulariz- 
ing (see Chapter 7 of [Ful69]) produces a nonsingular curve C with function 
field K. The map (t, u) takes C to the plane curve C , but this map is only 
guaranteed to be a birational equivalence, not necessarily an isomorphism. In 
general given two curves C and C", possibly containing singular points, a ra- 
tional map from C to C is an clement h of P r (k(C)) like a morphism but 
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only required to define an actual map from all but finitely many points of C 
to C . The rational map is a birational equivalence if there exists a rational 
map b! from C to C such that the two composites are the identity maps on C 
and on C wherever they are defined. See Exercise 7.2.11 for an example of 
birational equivalence between a singular curve and a nonsingular curve. A 
nonsingular curve corresponding to the field K need not lie in the plane, and 
this is why Definition 7.2.1 is not restricted to plane curves. In any case, with 
Theorem 7.2.5 in hand we can think of curves in terms of their function fields. 

Theorem 7.2.5 extends from curves and fields to maps between curves and 
maps between fields. If h : C — > C is a nonconstant morphism over k of 
nonsingular projective curves over k then as in the compact Ricmann surface 
environment, h is surjective. Its pullback is a map of function fields, 

/i*:k(C") — ►k(C), h*G = Goh. 

This isn't quite as simple as it looks, the fact that h* is well defined relying on 
a result about polynomial ideals and their solution sets (Exercise 7.2.12(a)). 
Another such result shows that if h is only assumed to be a nonconstant 
rational map then its pullback still makes sense (Exercise 7.2.12(b)). 

Theorem 7.2.6 (Curves Fields Correspondence, Part 2). LetC andC 
be nonsingular projective algebraic curves over k. Then the map 

h^h* 

is a bisection from the set of nonconstant morphisms over k from C to C to 
the set of \l- injections ofk(C') in k(C). 

Theorem 7.2.6 says that rather than constructing morphisms between 
curves it now suffices to construct suitable field injections instead. For ex- 
ample, since any nonconstant rational map h between nonsingular projective 
curves C and C defines a corresponding k-injection h* of k(C") in k(C), the 
theorem shows that in fact h is a morphism. In particular a birational equiv- 
alence between nonsingular projective curves is an isomorphism. For another 
example, consider the injection i : k(x) — 5- k(x, y) where x and y satisfy a 
Weierstrass equation. Letting £ denote the relevant elliptic curve, the mor- 
phism corresponding to i is h : £ — > P*(k) where h(x,y) = x. 

The theorem gives the map from morphisms to field injections explicitly. 
To describe the map from field injections to morphisms, let i : K' — > K be a 
k-injection of function fields. By Theorem 7.2.5 there exist nonsingular projec- 
tive curves C and C such that k(C") = K' and k(C) = K (k-isomorphisms), 
and i can be viewed instead as a k-injection 

i-.k(C') — >k(C). 

Define two points P and Q of C to be k(C")-cquivalcnt if the embedded 
image of k(C") in k(C) doesn't separate them, i.e., (iG)(P) = (iG)(Q) 
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for all G € k(C"). Then the quotient set of k(C")-equivalence classes in C 
bijects to the curve C", the equivalence class of P € C mapping to the 
unique point P 1 e C such that G(P') = (iG)(P) for all G. The morphism 
h : C — 5- C corresponding to i is the quotient map followed by the bijection. 
For instance, in the example ending the previous paragraph, pairs (x, ±y) 
in £ are k(x)-equivalent and the morphism h(x, y) — x identifies them in 
mapping £ to P 1 (k). We will use Theorem 7.2.6 and the quotient morphism 
in Section 7.8 to revisit isogenics of elliptic curves in algebraic terms. 

Exercises 

7.2.1. The polynomial ip(x, y) = x 2 — 2y 2 is irreducible over Q. Docs it define 
an algebraic curve over Q? Describe its solution set in P 2 (Q). 

7.2.2. Show that rap is a maximal ideal of k[C]. 

7.2.3. (a) In the proof of Lemma 7.2.2, show that Vip(P) is in the row space 
of [Djifi(P)]. (Hints for this exercise are at the end of the book.) 

(b) In the proof of Lemma 7.2.2, show that the pairing is perfect. 

(c) Let £ be an elliptic curve defined by a Weierstrass polynomial E, and 
let P = (x,y) be a point of £. Explain how the proof of Lemma 7.2.2 shows 
that x — xp + mp spans mp/mp if D 2 £ ; (P) ^ 0 and y — yp + mp spans 
mp/m 2 p if DxE{P) ^ 0. 

(d) In the discussion before Proposition 7.2.3, show that mp n Mp = mp. 

7.2.4. Let C be an affine algebraic curve as in Definition 7.2.1. Supply details 
as necessary in the following argument that the condition rank([Z)j</9j(P)]) = n 
for some P € C is impossible: If the rank is n then the method of proving 
Lemma 7.2.2 shows that mp — mp and so Mp = Mp. By Nakayama's Lemma 
Mp = 0 and so m P — 0. This forces k[C] = k and then k(C) = k, contradic- 
tion. 

7.2.5. A special case of the Hilbert Basis Theorem (see, for example, [Ful69]) 
says that the ring k[xi, . . . , x n ] is Nocthcrian, meaning that all ideals are 
finitely generated, or equivalently that there is no infinite chain of proper 
increasing ideal containments I\ C h C ^3 C • • • in \s[x\, . . . , x n ]. 

(a) Show that the two conditions are equivalent. 

(b) Let / be an ideal of k[xi, . . . , x n ]. Show that the ideals of the quotient 
k[xi, . . . ,x n ]/I arc in bijective correspondence with the ideals of k[xi, . . . ,x n ] 
that contain /. Conclude that k[xi, . . . , x n ]/I is Noetherian. In particular k[C] 
is Noetherian for an algebraic curve C. 

(c) For any ideal J of k[C]p the ideal Jn k[C] of k[C] is finitely generated 
by part (a). Show that its generators also generate J and therefore the local 
ring k[C]p is Noetherian. 

7.2.6. Show that the valuation at P on k(C) is well defined and prove Propo- 
sition 7.2.4. 
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7.2.7. Derive the new expressions for A and n in the section. 

7.2.8. Working over Q, let <p(x, y) = xy — 1 and <p'(z, w) = z 2 + w 2 — 1, and 
let C and C be the corresponding curves. Let 

The polynomials homogenize to ip(x,y,s) = xy — s and to ip'(z,w,t) = 
z 2 + w 2 —t 2 . Explain why the maps homogenize to (F, G) [x, y, s] — [2xs, x 2 — 
s 2 ,x 2 + s 2 ] and (F',G')[z,w,t] = [z 2 , (t - w) 2 , z(t - w)], and why these maps 
are 

{F,G)[x,y,s] = [2a,x-y,x + y], (F',G')[z,w,t] = [t + w,t-w,z]. 

Show that (F,G) takes C to C and that (F',G') takes C to C, including 
infinite points. Show that the maps invert each other. Thus the curves are 
isomorphic over Q. In particular, the sets of rational points on the hyperbola 
and the circle are isomorphic as algebraic curves. 

7.2.9. Consider the curves £ and £' defined by the Weierstrass relations y 2 = 
Ax 3 — Ax and y 2 = Ax 3 + Ax. Show that £ and £' are isomorphic as curves 
over Q by finding an admissible change of variable over Q between them. 
Show that £ and £' are not isomorphic as curves over Q either by showing 
that no appropriate admissible change of variable exists or by comparing their 
subgroups of 2-torsion points with rational coordinates, £[2](Q) and £'[2](Q). 

7.2.10. Working inside a fixed algebraic closure of k(i), let K = k(i, u) be 
a field as in Theorem 7.2.5 and let / G k(i)[y] be the minimal polynomial 
of u over k(t). After clearing denominators the relation f(u) = 0 rewrites as 
a polynomial relation between t and u over k, ip(t,u) = 0 where ip £ \c[x,y]. 
The text asserts that (p is irreducible over k because Knk = k. Supply details 
as necessary for the following argument that this is so: If (p = ipiifi2 over k 
then this factorization occurs over a finite extension field of k, necessarily of 
the form k(v) where g(v) = 0, g £ k[z] is irreducible. Also g is irreducible 
in k(t)[z] since its roots lie in k, so that the coefficients of any factorization 
would lie in k(i) n k = k. Let 1 = k(t,v). Thus [K : k(t)] = deg(/) and [1 : 
k(t)] = deg(g). Consider the composite field Kl = k(t, u, v). The factorization 
of ip over k(v) gives a factorization of / over 1, and this makes the extension 
degree [Kl : 1] less than dcg(/). Thus [Kl : k(t)] < deg(/) deg(^) and [Kl : 
K] < deg(g), and consequently g = g\g2 in K[z]. Since the roots of g lie in k, 
the factorization of g occurs in (K n k)[z] — k[z], giving a contradiction. 

7.2.11. Let (p(x,y) = y 2 — x 3 — x 2 , and let C be the corresponding curve. 
Show that C is an algebraic curve in the sense of Definition 7.2.1 but that it 
has a singular point at (0, 0). Let 



F(x,y) = y/x, (G, H) (t) = (t 2 - 1 , t 3 - t) . 



268 7 Modular Curves as Algebraic Curves 



Show that F and (G, H) are birational equivalences between C and k but not 
isomorphisms. 

7.2.12. (a) Let I e k[xi, . . . , x n ] and /' e k[j/i, . . . , y r ] be prime ideals defin- 
ing nonsingular projective algebraic curves C and C. Using projective nota- 
tion, let 

h = [h 0 + /,..., h r + I], each hi + I e k[C] 
be a nonconstant (and hence surjective) morphism from C to C, and let 

G = [.go + /' , 9i + I'} , each 9i + I' e k [C] , 5o + J V /' 

be an clement of the function field k(C"). The pullback h*G = G o h E k(C) 
is 

= [9o ° (fto, • ■ • , ftr) + 1, 9i ° (fto, • • ■ , K) + I]- 

But to make sense this must be independent of how the various coset represen- 
tatives are chosen. Show that replacing any hi by hi + if where tp S I doesn't 
change the definition of h*G. Show that replacing either by gi + ip' where 
ip' G /' doesn't change the definition of h*G provided that for any polynomial 
ip e k[xi, . . .,x n ], 

ip(P) = 0 for all P e C =^> ip el. 

The definition of C as the solution set of I immediately gives the opposite 
implication, but this also follows using the fact that I is prime. For the details 
see the first few pages of an algebraic geometry text. 

(b) Now make the weaker assumption that h is a nonconstant rational 
map from C to C . Show that verifying that h*G is well defined now requires 
the additional implication 

ip(P) = 0 for all but finitely many P e C ip(P) = 0 for all P e C. 

Again, see an algebraic geometry text. 

7.3 Divisors on curves 

This section begins by citing some results from Riemann surface theory in 
the new context of algebraic curves. Every compact Riemann surface has a 
description as a nonsingular projective algebraic curve over C. The meromor- 
phic functions on a compact Riemann surface are the rational functions on the 
surface viewed as a curve, and the holomorphic functions between compact 
Riemann surfaces are the morphisms between the surfaces viewed as curves. 
Many results about Riemann surfaces rephrase algebraically and then apply 
to algebraic curves over an arbitrary field k, although sometimes the charac- 
teristic of k needs to be taken into account when it is not 0 as we assume it 
is here. 
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For instance, as mentioned in the previous section, a morphism of nonsin- 
gular projective algebraic curves is either constant or surjectivc. If h : C — > 
C is a nonconstant morphism then its pullback is a k-injection of function 
fields, h* : k(C") — > k(C), and the degree of h is defined as the field extension 
degree 

deg(fc) - [k(C) : h*(k(C'))]. 

Thus a morphism is an isomorphism if and only if its degree is 1 (Exer- 
cise 7.3.1(a)). The formula shows that the degree of a composition of non- 
constant morphisms is the product of the degrees, and this extends to all 
morphisms if constant morphisms are given degree 0. 

The degree of h is the number of inverse images under h of each point 
of C suitably counting an algebraically defined ramification. Specifically the 
ramification degree of ft. at a point P G C is the positive integer 

ep(h) = fp(t' o h) where t' is a uniformizer at h(P). 

Thus t' oh = t ep ^F where t is a uniformizer at P and F G k(C) with 
vp(F) = 0. The resulting degree formula is analogous to the result in Riemann 
surface theory, 

e P (h) = deg(/i) for any Q G C . (7.4) 

Peh- 1 (Q) 

(As a special rational function has only finitely many zeros and poles.) 

For morphisms h : C — > C over k of curves over k, the methods of Exer- 
cise 7.2.10 show that (Exercise 7.3.1(b)) 

[k(C) : ft*(k(<7'))] = [k(C) : h*mC))}, 

so that deg(h) is defined by either of these. 

In particular, let £ be an elliptic curve over k and let N be a positive 
integer. We assert without proof that since k has characteristic 0 the map 
[N] : £ — > £ is unramified and has degree ./V 2 , 

[k(£)--[NT(M(£))] = N 2 . (7.5) 

The terminology of divisors carries over from Riemann surfaces to curves. 
Let C be a nonsingular algebraic curve over k. The divisor group of C is the 
free Abelian group on its points, 

Div(C) = < Y n p( p ) ■ n P € Z, n P = 0 for almost all P I . 

[pec J 

Since both an elliptic curve and its divisor group have group laws, we have 
adopted the notational convention of parenthesizing points in divisors, thus 
the divisor (P) versus the point P and more generally in the case of an elliptic 
curve 
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np(P) E Div(£) versus ^[n P ]P E £. 

Coefficients n P in the divisor on the left can be negative, so the addition in £ 
on the right is using the definition [np]P = — [— np]P when np < 0. Returning 
to a general curve, the subgroup of degree-0 divisors is 

Div°(C) = tjT n P (P) e Div(C) : ^ n P = °} • 

The divisor of a nonzero element of the function field of C is 

div(F) = 5> P (F)(P), fek(C)'. 

Divisors of the form div(_F) are called principal and the set of principal divisors 
is denoted Div £ (C). Principal divisors have degree 0. This follows from another 
formula analogous to Riemann surface theory, 

div(F)= J2 ^( F )( p )- E e ^xn 

and from (7.4), combining to show deg(divF) = deg(F) — deg(F) = 0. Part (1) 
of Proposition 7.2.4 shows that the map 

div : k(C)* — > Div°(C) 

is a homomorphism. Thus Div (C) = div(k(C)*) is a subgroup of Div 0 (C). As 
in Chapter 6 the (degree zero) Picard group of C is the (degree zero) divisor 
class group, the quotient 

Pic°(C) =Div°(C)/Div^(C). 

We will need the following result in Chapter 8. 

Proposition 7.3.1. Let C be a nonsingular projective algebraic curve over a 
field k. Let S C C be a finite subset. Then every divisor class in Pic°(C) has 
a representative J2 n p(P) suc ^ that np = 0 for all P G S. 

Proof. We may assume that all of S lies in affine space k™. To see this, note 
that any matrix A e GL n+ i(k) defines an automorphism of P"(k) just as 
2-by-2 matrices act on the Riemann sphere C = P 1 (C). Each such automor- 
phism restricts to an isomorphism from C to another curve C taking S to 
a set S . Choose vectors vp e k that represent the points P e P™(k) 
of S. There is a vector a n+1 G k that is not orthogonal to any vp (Excr- 

cise 7.3.2(a)). Extending to a basis {oi, . . . , a n +\} of k and then making 
these the rows of A gives an isomorphism of C taking S into k as desired. 
With this done, all points of S fall under the purview of an affine description 
of the function field k(C) even though C is projective. 
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Recall from Section 7.2 the coordinate ring of G and its localization at P, 
k[G] C k[C]p, and recall their maximal ideals of functions that vanish at P, 
mp C Mp. For each point P of S define an intermediate ring and maximal 
ideal to avoid poles in S, 

k[C]s - If/ 9 G k(C) : g(Q) ± 0 for all Q E S}, 
Ms.p = {f/g e k[C] s : /(P) - 0}. 

Thus ui p C Ms^p C Mp, and so there is a natural map 

mp/mp — ► M s ,p/Mg tP — >■ M P /Mp. 

Section 7.2 showed that the composite is an isomorphism of nontrivial vector 
spaces, so the middle quotient is nonzero. Choose any Gp E M S . P — M| p . 
Then Gp € Mp. The argument given in the hint to Exercise 7.2.3(d) modifies 
easily to show that M s ,p n Mp = M| p (Exercise 7.3.2(b)), so G P ^ Mp. 
This shows that vp{Gp) = 1. By the Chinese Remainder Theorem there is a 
surjective map 

k[C] s — ► k[C] s /M|p x J] k[C]s/M SiQ . 

In particular, some Fp G k[C]s maps to (Gp, 1, . . . , 1). This function Fp 
therefore has valuations 

v P {Fp) = \, u Q {F P ) = Q for all Q e 5 - {P}. 

Now the result is clear. Take any divisor in Div (C). Adding the divisor of a 
suitable product IlpeS Fp p gives an equivalent divisor at the level of Pic°(C) 
but with n P = 0 for all P E S. □ 

If ft, : C — > C is a nonconstant morphism then it induces forward and 
reverse maps of Picard groups, 

K ■ Pic°(C) — ► Pic°(C") and h* : Pic°(C") — > Pic°(C), 

given by 

K{[Y,n P (P)}) = [^np(MP))] 
p p 

and 

^([E^w)])-tE^ E epW(p)]. 

Q Q Peh-^Q) 

Here the square brackets denote equivalence class modulo Div (G) or Div £ (G') 
as appropriate. These maps were denoted hp and h p in Section 6.2. They con- 
tinue to make sense in the algebraic geometry environment because at the level 
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of divisors they take degree-0 divisors to degree-0 divisors and they satisfy 
/i*(div/) = div(norm; l /) where norm is defined as in Section 6.2 (cf. Exer- 
cise 6.2.4) and h*(divg) — div(h*g) where this last h* is the pullback. Also 
as in Section 6.2, mapping backwards and then forwards multiplies by the 
degree, 

h* o h* = deg(/i) onPic°(C"). 

For example, let £ be an elliptic curve over k and consider a first degree 
function in k(£), 

F(x, y) — ax + by + c, a, b, c G k, a or b nonzero. 

This function has zeros at the affine points where the line with equation 
F(x, y) = 0 meets £, and it has a pole at Of of the right order for its divisor 
to have degree 0. Letting P, Q, R denote the points where the line meets £, 
allowing repeats and allowing Of, in all cases 

div(F) = (P) + (Q) + (R) - 3(0 £ ). 

Specializing to F{x, y) = x, the vertical line meets the affine part of £ twice, 
showing again that vo E (x) = —2 as in the previous section; specializing to 
F(x, y) — y, the horizontal line meets the affine part of £ three times, showing 
again that i/Q £ (y) = —3. Thus a uniformizer at Of is x/y as before. 

In particular, let £ be the elliptic curve over Q with Weierstrass equation 
E : y 2 — 4x 3 — Ax. Consider the rational function 

F(x,y)= V - e Q(£). 
x 

The numerator y of F vanishes at the points (±1,0) of £, where the denomi- 
nator x is nonzero and so F has zeros. To compute their orders, note that y 
is a uniformizer at these points since DxE{±.l, 0) ^ 0, making V(±\.q){F) = 1. 
The numerator y is also a uniformizer at the point (0, 0) of £ where the de- 
nominator x vanishes, but as in the previous section F(x, y) = (4x 2 — 4)/y as 
well and F has a pole at (0, 0) with V(o t o)(F) = — 1. There are no other zeros 
or poles at affine points of £. At the infinite point Of the previous paragraph 
shows that F has a pole and v 0e (F) = — 1. Thus 

div(F) - -(0, 0) + (1, 0) + (-1,0) - (Of). 

Section 1.4, especially Exercise 1.4.3, shows that this is essentially the same as 
the divisor of the meromorphic function F(z) — p\{z) / 'pi(z) on the complex 
elliptic curve C/Ai as shown in Section 3.4. See Exercise 7.3.3 for a similar 
example. 

The next section will redefine the Weil pairing for an elliptic curve £ over 
a field k. Doing so requires a characterization of principal divisors on the 
curve, to be derived from a small calculation. Consider two distinct points P 
and Q of £. The claim is that there is no rational function F e k(£) with 
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divisor (P) — (Q). This can be shown using general machinery since such a 
map would have to be an isomorphism from the genus 1 curve £ to the genus 0 
curve P 1 (k) and genus is an algebraic invariant (or cf. Exercise 6.1.3), but we 
give an elementary proof here. Given such a function F, considering F o tq 
(where again tq(X) = X + Q) shows that we may assume Q = Of , i.e., 
div(P) = (P) - (Of). The function F takes the form (Exercise 7.3.4(a)) 

F{x,y)=F 1 {x)y + F 2 {x), F 1 ,F 2 ek[x}. (7.6) 

At an affine point P of £ whose y-coordinate is nonzero, (7.6) gives the second 
equality in the calculation 

v R {F x {x)) = MFi(x)y) = M(F(x,y) - F(x,-y))/2) 

> mm{u R {F{x, y)),v R (F(x, -y))} > 0. 

At an affine point R whose y-coordinate is 0, vn{y) — 1 and vr(x) is 0 or 2, 
making u R (Fi(x)) even for i = 1,2 (Exercise 7.3.4(b)) and so VR(F 1 (x)y) and 
vr(F 2 (x)) are distinct. This gives 

0 < y R {F) = min{i/ fl (Fi(a;)) + l,v R {F 2 (x))} < MF^x)) + 1, 

and again vr(Fi{x)) > 0 since it is even. At the projective point Of, 
vo £ (x) = —2 and vv £ {y) = —3 as shown above, so vo £ (Fi(x)) is even for 
i = 1,2 (Exercise 7.3.4(b) again) and the relation vo e {F) = — 1 is 

min{i/ 0£ {Ft (x) ) - 3 , v Qs {F 2 (x) ) } = - 1 , 

forcing v 0e (Fi(x)) = 2. Thus X)pe£ v p{F\{x)) > 0, contradicting the fact 
that principal divisors have degree 0. It follows that no F exists as stipulated. 
These examples set up 

Lemma 7.3.2. Let £ be an elliptic curve over k. 

(a) For any points P,Q e £, 

the divisor (P) — (Q) is principal P = Q. 

(b) For any points P,Q e £ there is a congruence of divisors 

(P) - (0 £ ) + (Q) - (0 £ ) = (P + Q) - (0 £ ) (mod Div £ (£)). 

Proof, (a) The implication ( =>• ) was shown in the calculation above. The 
other implication is immediate. 

(b) Let the line through P and Q have equation ax + by + c = 0 and 
let R = — P — Q be the third collinear point of £ . The formula div(P) = 
(P) + (Q) + (R) - 3(0g) for F(x, y) =ax + by + c shows that 

(P) - (0 £ ) + (Q) - (0 5 ) + (P) - (Of) e Div £ (£). 
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If R = Of this gives the result. Otherwise let the vertical line through R 
and — R have equation x + c = 0. Changing the example to F(x, y) = x + c 
shows that 

(R)-(0 £ ) + (-R)-(0 £ )eBiy i (£), 

and since — R = P + Q the result follows from subtracting the two displayed 
relations. □ 

Recall that in the complex analytic environment of Chapter 6 the map of 
Abel's Theorem followed by the natural identification of the Jacobian of an 
elliptic curve with the curve itself gave a composite isomorphism from Pic°(£ ) 
to £. We can now prove an algebraic version of this, no longer requiring k = C. 

Theorem 7.3.3. Let £ be an elliptic curve. Then the map 

Div(£) — > £ , Y, np ( p ) ^ Y} np ^ P 
induces an isomorphism 

Pic 0 (£)^>£. 

Therefore the principal divisors on £ are characterized by the condition 

Y n p{ p ) e Div*(£) Y np = 0 and H[ np ] P = ° £ ■ 

Proof. The map Div(£) — > £ of the theorem is clearly a homomorphism. Its 
restriction to Div°(£) is surjective since (P) — (Of) i->- P for any P S £. The 
restriction has kernel Div (£) because for a degree-0 divisor ~Yl,np(P), 

Y[ n p] p = °£ (H[" p ] P ) ~ ( 0f ) e Div£ (^) by Lemma 7.3.2(a) 

np (( P ) ~ e Div£ ( f ) b y Lemma 7.3.2(b) 

^ n P (P) e Div £ (£) since n P = 0. 

□ 

If /i : £ — > £' is a morphism of elliptic curves over k such that /i(0f ) = Of 
then the isomorphisms Pic°(£) £ and Pic (£' ) £' of Theorem 7.3.3 
and the fact that h* : Pic°(£) — ► Pic°(£') is a homomorphism combine to 
show that h is a homomorphism as well (Exercise 7.3.5). The characterization 
of principal divisors on elliptic curves in the theorem generalizes the complex 
analytic criterion from Exercise 1.4.1(c). 

Exercises 

7.3.1. (a) Show that the degree of a composition of morphisms over k is the 
product of the degrees, and so such a morphism is an isomorphism if and only 
if its degree is 1 . 

(b) Let h : C — > C be a nonconstant morphism defined over k. Show 
that [k(C) : /i*(k(C"))] = [k(C) : h*(k(C'))}. 
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n-\-l 

7.3.2. (a) Let {vp : P € S} be a finite set of nonzero vectors in k . Show 

n+l 

that there exists some vector a € k that is not orthogonal to any vp. (A 
hint for this exercise is at the end of the book.) 
(b) Show that M S ^ P n Mp = Af| p . 

7.3.3. Compute the divisor of the rational function F(x,y) — y/x on the 
elliptic curve y 2 = 4a; 3 — 4. Use the results of Exercise 1.4.3 to verify that it 
corresponds to the divisor of the meromorphic function F(z) = p^ 3 (z) / p^ 3 (z) 
on the complex elliptic curve C / A^ 3 . 

7.3.4. (a) Letting £ be defined by a Weierstrass equation (7.1), show that any 
rational function F <G k(£) takes the form (7.6). (A hint for this exercise is at 
the end of the book.) 

(b) Show that if fp(x) is even then so is vp(F(x)) for any rational func- 
tion F. 

7.3.5. Let h : £ — > £' be a morphism of elliptic curves over k such that 
h(Qs) = Of/. Consider the diagram 

Pic°(£) >£ 

h, 

Pic°(£') 

where the top row is [(P) — (Oe)] ^ P and similarly for the bottom row. Show 
that the diagram commutes and therefore h is a homomorphism. 



7.4 The Weil pairing algebraically 

To define an algebraic version of the Weil pairing using Theorem 7.3.3 let fi N 
denote the group of iVth roots of unity in k, 

Hn = {i£k:i A ' = l}. 

Since char(k) = 0 this can be viewed as the usual cyclic group of order N 
in Q. Recall that £[N] = (Z/NZ) 2 . Construct the Weil pairing, 

e N : £[N] x £[N] — »■ » N , 

as follows. Let two iV-torsion points P, Q e £[N] be given, possibly P = Q. 
By the characterization of principal divisors in Theorem 7.3.3 there exists a 
function / = /q £ k(£) such that 



div(f)=N(Q)-N(Q £ ). 
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(The notation F = f/g is no longer needed, so rational functions are denoted 
by lower case letters from now on.) Since [N] : £ — > £ is unramificd, the map 
/ o [N] has divisor 

div(/o[7V])= ]T N(R)- N W- 

R:[N]R=Q S:[N]S=0 S 

If Q' G £[A^ 2 ] is any point such that [N]Q' = Q then the divisor is 
div(fo[N]) = N ]T (Q' + S)-(S). 

S££[N] 

Again by Theorem 7.3.3, this time applied to the sum on the right, the equality 
takes the form div(/o [N]) = Ndiv(g) for some g — gq e k(£). Thus fo [N] = 
g N after multiplying g by a nonzero constant if necessary. For any point X E £, 

g(X + P) N = f([N]X + [N]P) = f([N)X) = g(X) N . 

Consider the rational function g(X + P)/g(X) € k(£). Since its A^th power 
is 1 its image is a subset of fx N , making the map constant since it doesn't 
surject from £ to P 1 (k). The image point is the Weil pairing of P and Q, 

Proposition 7.4.1 (Properties of the Weil pairing). 

(a) The Weil pairing is bilinear, 

e N (aP + bP',cQ + dQ') - e N (P, Q) ac e N (P, Q') a d e N {P', Q) bc e N (P', Q') bd . 

(b) The Weil pairing is alternating and therefore skew symmetric, 

e N (Q,Q) = l, e N (Q,P) = e N (P,Q)- 1 . 

(c) The Weil pairing is nondegenerate, 

ife N (P, Q) = 1 for all P e £[N] then Q = 0 £ . 

Consequently the Weil pairing surjects to fi N . 

(d) The Weil pairing is Galois invariant, 

e N {P,QY = e N (P a ,Q a ) for all a e Gal(k/k). 

(c) The Weil pairing is invariant under isomorphism, i.e., if i : £ — > £' is 
an isomorphism and e' N is the Weil pairing on £' then 

e' N {i{P),i{Q)) = e N (P, Q) for all P,Q e £[N]. 
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Proof, (a) For linearity in the first argument let g = Qq and compute 
g(X + P,+ P 2 ) g(X + P, + P 2 ) g(X + P 2 ) 



e N (Pi + P2,Q) 



g{X) ~ g(X + P 2 ) g(X) 

e N (Pi,Q)e N (P2,Q), 



using X+P 2 for X to evaluate e^iPi, Q)- For linearity in the second argument, 
let /i, f 2 , /3, and gi, g 2 , g% be the functions associated to Qi, Q 2 , and Qi + Q 2 - 
Choose h £ k(£) with divisor 

div(fc) = (Qi + Q 2 ) - (Qi) - (Q 2 ) + (0 £ ). 

Then div(/ 3 /(/i/ 2 )) = ATdiv(/i), so / 3 = chf 2 h N for some c e k*. Thus 
h ° [iV] = c(/i o [JV])(/ 2 o [AT](/i o [7V]) W , or ff f = c( 3l52 (fe o [AT]))*, or 
.93 = c'gig 2 (h o [N]). Using this, compute 

P (pn i n \ g3{X ± P) gi{X ± P)g2(X ± glMffi + M 
e w (P, Qr + Q 2 ) - -^j- - ffl( x) 52 (X)M[iV]X) 

= e 7 v(P,Q 1 )e A r(P,Q 2 ). 

(b) To show ejv(Q, Q) = 1 for all Q £ £[N], let / = /q and g = gq and 
compute 

AT-1 JV-l 

div( [] /(* + WO)) - E ^(t 1 - n lQ) - ^([-«]0) = °- 

n=0 n=0 

Hence Ilri^o 1 /(-^ + N<3) i s constant, and so if Q = [N]Q' then also 
rin=o 1 5(^ + MQ') 1S constant because its TVth power is the previous prod- 
uct at [A?]X. Equating the second product at X and X + Q' gives g(X) = 
g(X + [N]Q') = g(X + Q), and e N (Q, Q) = g{X + Q)/g(X) = 1 as desired. 
Every alternating bilinear form is skew symmetric (Exercise 7.4.1(a)), giving 
the rest of (b). 

(c) For nondegeneracy, if cn(P,Q) = 1 for all P £ £[N] and g = gQ 
then g(X + P) = g(X) for all P £ £[N}. This periodicity implies that g = 
ho [N] for some function h, but we don't yet know that h is rational. Let r* : 
£[N] — > Aut(k(£)) map each TV-torsion point P to its associated translation 
automorphism t p taking f(X) to f(X + P) for all / £ k(£). Then r* is 
a homomorphism. If P £ ker(r*) then in particular rp acts trivially on the 
function fp with divisor N(P) —N^Og) as in the definition of the Weil pairing, 
so f P (0e) = Tpf P (0 £ ) = f P (P) and this forces P = 0 e . Thus r* injects, k(£) 
is a Galois extension of its subfield fixed by t*(£[N]), and the Galois group 
is isomorphic to £[N]. But the fixed field contains the subfield [A]*(k(£)) = 
{h o [N] : h £ k(£)}, while formula (7.5) is_[k(£) : [N}*(k(£))} = N 2 . Since 
N 2 = \£[N]\ this makes the fixed field [N]*(k(£)). The condition g(X + P) = 
g(X) for all P £ £[N] means that g is in the fixed field, i.e., g = ho [N] for 
some h £ k(£). From this and from the setup of the Weil pairing, 
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div(/i o [N]) = div(ff) = ^ (Q' + S) - (S) where [JV]Q' = Q. 

se£[N] 

It follows that div(/i) = (Q) — (Of), and so Q = by Lemma 7.3.2(a), 
making the Weil pairing nondegenerate. To see that it surjects, note that 
the image e n (£ [N] , £ [N ] ) is a subgroup /x M of /x^ for some M | TV. Thus 
e N {[M]P, Q) = e N (P, Q) M = 1 for all points P, Q G £[N], forcing [M]P = 0 £ 
for all P, hence M = N. 

(d) For Galois invariance, let a G Gal(k/k). Then /q<t = and 5q<^ = gg 
(Exercise 7.4.1(b)), and so 

„ (p° n°) ^(x^ + Pn ( g{x + P) Y_ r (POY 

(e) If i : £ — ^ £' is an isomorphism then running through the definition 
shows that the function g' for e' N is g'g, = 5i-i(Q') ° Thus 

( ,, plYn » 9'n Q) (X' + i(P)) g Q (X + P) 

e N (i(P),i(Q)) = -, — — = - = e N (P,Q). 

9i(Q)i A ) 9Q( A ) 

a 

We will use the following result (Exercise 7.4.2) in the next section. As 
ever, char(k) = 0. 

Corollary 7.4.2. Suppose P, Q, P' , and Q' are points of £[N], and P' , Q' 
are linear combinations of P, Q, 

for some 7 e M 2 {Z/NZ). 
Then 

e N (P',Q') = e N (P,Q) dct \ 

In particular, if(P,Q) is an ordered basis of £[N] over Z/NZ then ejv(-P, Q) 
is a primitive Nth root of unity. 

If e' N : £[N] x £ [N] — > fj, N is any function satisfying properties (a), (b), 
and (c) of the Weil pairing then e' N = for some n G {Z/NZ)* (Exer- 
cise 7.4.3). This shows that the Weil pairing as defined on a complex torus 
C/A in Section 1.3 is a power of the algebraic Weil pairing on the elliptic 
curve y 2 = 4x 3 — g%{A)x — 33(^1). In fact they are equal, though we omit the 
exercise in complex function theory that shows this. 



~P'~ 




~p~ 


_Q'_ 


= 7 


Q. 



Exercises 

7.4.1. (a) Show that an alternating bilinear form is skew symmetric, 
(b) Let a e Gal(k/k). Explain why /q<t = /q and gQ<? = gq. 

7.4.2. Prove Corollary 7.4.2. 

7.4.3. Show that any function satisfying properties (a), (b), and (c) of the 
Weil pairing takes the form e'y for some n G (Z/NZ)*. 
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7.5 Function fields over C 

From Section 3.2, the function field of the level 1 modular curve X(\) = 
SL 2 (Z)YH* over C is generated by the modular invariant, 

C(X(l)) = A>(SL 2 (Z)) = C(j). 

This section describes the function fields for the curves X(N), Xi(N), and 
X a (N), where N > 1. The extension C(X(N))/C(X(1)) turns out to be 
Galois with group SL 2 (Z/WZ)/{±/}, and it reproduces the moduli space de- 
scription of modular curves from Chapter 1. The end of the section constructs 
a related function field extension with Galois group SL 2 (Z/7VZ). 

For each vector v E Z 2 such that 5^0, where as before v is the reduction 
of v modulo N, recall from Section 1.5 that the function 

t v, s 02 (t) fc v r + d v \ 

saM V — w — J' v = ( c v> d v) ( 7 - 7 > 

is weight-0 invariant under r(N). In fact /g G C(X(AT)). To see this we 
must show that /q is meromorphic on % and at the cusps. Meromorphy 
on % is straightforward (Exercise 7.5.1(a)). For the cusps, first verify that 
(Exercise 7.5.1(b)) 

fXM = fp(T), 7 eSL 2 (Z),reH. 

Then checking that all /q are meromorphic at oo (Exercise 7.5.1(c)) also 
checks that they are all meromorphic at the other cusps as well (Exer- 
cise 7.5.1(d)). Since the other functions from Section 1.5, 

N-l 

/oV) = / 0 (M) (r), d # 0 (mod N) and / 0 (r) = £ fjj(r), 

d=l 

are sums of special cases of /q and invariant under and r 0 (N) re- 

spectively, it follows that /| G C(Xi(iV)) and / 0 G C(X 0 (iV))- For another 
example, let jjv( T ) = j(Xr). Then j'jv € C(X 0 (A r )) by the methods of Exer- 
cise 1.2.11 (Exercise 7.5.2). These are all the examples we need. Introduce a 
little more notation for convenience, 

, _ ,±<To) , _ , _ ,±(oo) 

71,0 - Jo > 70,1 - Jl - Jo 

Proposition 7.5.1. TTie jzeMs of meromorphic functions on X(N), Xi(N), 
and X 0 (N) are 

C(X(N)) = C(j, {/±* : ±v G ((Z/-/VZ) 2 - (0, 0))/±}) 

= C(j,/i, 0 ,/o,i), 
C(Xi(JY)) = C(j, {/ 0 ±3 : ±d G (Z/NZ - 0)/±}) 

= C(i,A), 
C(X 0 (JV)) = C(j,/o)=C(j,j JV ). 



280 7 Modular Curves as Algebraic Curves 



Proof. Since p r (z) = p T (z') if and only if z = ±z' (mod A T ), each pair of 
functions /q and /J"" are equal but otherwise the /q are all distinct. Taking 
the set {f^ 7 " : ±v G ({Z/NZ) 2 - (0, 0))/±} gives the field containments 

C(X(1)) = C(j) C C(j,{f**}) C C(X(N)). 

Consider the map 

9 : SL 2 (Z) — > Aut(C(X(JV))), 
= / o 7 for / e C(X(N)), 7 G SL 2 (Z). 

This map is a homomorphism (Exercise 7.5.3(a)) and its kernel clearly con- 
tains {±I}r(N). Since any element of the kernel fixes the subficld C(j, {/ 0 "}), 
the earlier calculation /q 07 = /q 7 for 7 G SL 2 (Z) and the fact that the are 
distinct show that the kernel is also a subgroup of {±I}r(N), making them 
equal. Thus 0(SL 2 (Z)) = SL 2 (Z)/{±/}r(iV). Since 6>(SL 2 (Z)) is a group of 
automorphisms of C(X(N)) and the subfield it fixes is C(X(1)) by definition, 
the extension C(X(N))/C(X(1)) is Galois with Galois group 0(SL 2 (Z)). The 
formula /g o 7 = /g 7 shows that the intermediate field C(j, {/q }) and its 
subficld C(j, /1.0, /o,i) both have trivial fixing subgroup and thus both are all 
of C(X(N)), proving the first part of the proposition. The other two parts 
are shown by similar arguments (Exercise 7.5.3(b,c)). □ 

The function fields in the proposition satisfy the description given before 
Theorem 7.2.5 of function fields of algebraic curves over C. Thus Xi(N) is 
birationally equivalent to the plane curve defined by the complex polynomial 
<pi € C[a;,y] such that <pi(j, /1) = 0, and X 0 (N) is birationally equivalent 
to the plane curve defined by the complex polynomial ipo G C[x, y] such 
that Vo(i>/o) = 0. Section 7.7 will show that in fact the polynomials have 
rational coefficients. In connection with X 0 (N) the proposition also leads to 
the modular equation, the relation between j and jjv, which will also turn out 
to be defined over Q, 

$nUJn) = 0, $N(x,y) € Q[x,y]. 

This classical topic is discussed in [Cox97], in [Kna93], and briefly in Milne's 
online notes [Mil] , and we will not pursue it except as it bears on the material 
here. 

The proof of Proposition 7.5.1 has shown more than the proposition 
asserts. Since the field extension C(X(N))/C(X(1)) is Galois and since 
SL 2 (Z)/7^(A^) = SL 2 (Z/A r Z), the map 9 in the proof induces an isomorphism 

9- 1 : Gal(C(X(AT))/C(X(l))) SL 2 (Z/iVZ)/{±/}, 
= f o 9- 1 {a) for / e C(X(N)), a G Gal(C(X(iV))/C(X(l))), 

where 9~ 1 {a) acts on H as any of its lifts to SL 2 (Z). In particular, (fQ V ) a = 
f^ v o 9~ 1 {<t) — f^ v6 ^ so that 9~ 1 defines an action of the group 
Gal(C(X(A0)/C(X(l))) on the set {{Z/NZ) 2 - (0, 0))/±, 
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±v^ ±v0- 1 (a). (7.8) 

Exercise 7.5.3(b,c) shows that the subgroups of SL 2 (Z/7VZ) /{±/} correspond- 
ing to A (AT)) and T 0 (A^) are 

0- 1 (Gal(C(X(7V))/C(X 1 (iV)))) = {± [11] e SL 2 (Z/7VZ)/{±/}} (7.9) 

and 

0- 1 (Gal(C(X(7V))/C(X o (iV)))) = {± [g J] G SL 2 (Z/7VZ)/{±/}} . (7.10) 



To understand the results of Proposition 7.5.1 algebraically, recall yet again 
that the map (p r , p' T ) takes the complex elliptic curve C/A T to the algebraic 
curve E T : y 2 = 4x 3 — ^(t)^ — .93 ( T )- (From now on the symbol inter- 
changeably denotes an elliptic curve, its equation, or its polynomial, and the 
symbol £ will no longer be used.) Equation 7.7, defining the functions /o (t) in 
terms of p T ((c v T + d v )/N), thus combines with Proposition 7.5.1 to show that 
the meromorphic functions on X(N) are generated by the modular invariant j 
and by functions of r closely related to the x-coordinates of the nonzero N- 
torsion points on the curve E T . We now scale the curve E T to a new curve so 
that indeed C(X(N)) is generated by j and the A^-torsion x-coordinates. 

Fix any t E W. such that j(r) 4- {0j 1728}. This means that <7 2 (r) and g-$(j) 
are nonzero since j — 1728^2 /(fli ~ 27gf). Choose either complex square root 
(6'2(t)/(73(t)) 1 / 2 and consider the map 



ggj/r) 
.93 (t) 



: C//l r — -> C 2 U{oo}. 



This differs from (p T ,p' T ) by the admissible change of variable (x,y) = 
(u 2 x\u z y') where u = (g 3 (t)/^^)) 1 / 2 . Thus u~ 4 = {92^) / g^r)) 2 and 



u 



~ 6 = (<7 2 (t)/<73(t)) 3 , and nonzero points 2 + vl r map to points (x,y) sat- 
isfying the suitable modification of the cubic equation of E T according to 
Exercise 7.1.1, 

E j(T) : y 2 = 4.x 3 - - 



.3 g2(r) 3 ^ g 2 (r) 3 
5s(t) 2 3s(r) 2 ' 



Since 3 f = ( 5 3 - Zi)/27, so that g$/g 2 = 27g 3 /( 3 3 - Z\) = 27j/(j - 1728), the 
equation is defined in terms of j(r), justifying the name of the curve, 

Ej( T ) : y 2 = 4, 3 - ( ^ ) x - ( ^ ) . 
3{) y \j{T)-m&) \j(T)- 1728j 

(Cf. the method in Section 7.1 reducing the general y 2 = 4.t 3 — .g 2 x— 173 to the 
form y 2 = 4x 3 — gx — g.) The equation is independent of which square root 
(52(1") /^(t)) 1 / 2 was chosen. The map C/A T Ejr T \ restricts to an isomor- 
phism of A^-torsion subgroups. In particular it takes the canonical generators 
t/N + A T and l/N + A T of (C/A T )[N] to the points 
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"-(^^■(^H- (7U) 

Negating the square root (fl^M/saO 7 ")) 1 ^ 2 negates these points, but modulo 
this {P Tl Q T ) is a canonical ordered basis of Ejr T \[N] over Z/NZ. The x- 
coordinates of ±P T and ±Q T are /i,o(t) and /o,i(t) = /i(r) respectively, and 
more generally the nonzero points of fy^A 7 ] have x-coordinates {fo V (r)} 
as desired. The information j(r), /i,o(t), fo,i( T ) thus describes an enhanced 
elliptic curve for I^AQ modulo negation, 

(^• (T) ,±(P T ,Q T )). 

This is the sort of element that represents a point [C /7l r , (r /N+A r , 1 //V+ yl T )] 
of the moduli space S(A), excluding the finitely many such points with 
j(r) e {0, 1728}. Similarly, the information j(r), /i(r) describes (Ej^,±Q T ), 
representing a point of Si(A'), and j(r), /o(t) describes (Eji T \, {Q T }), repre- 
senting a point of So (AT). The moduli space description of modular curves is 
emerging from the function field description in Proposition 7.5.1. 

Change r to a variable so that j — j(r) varies as well. This gathers the 
family of elliptic curves £y( T ) into a single universal elliptic curve, 

whose ^-invariant is indeed the variable j (Exercise 7.5.4). The universal el- 
liptic curve specializes to a complex elliptic curve for every complex j except 
0 and 1728. To make this idea algebraic view j as a transcendental element 
over C and view Ej as an elliptic curve over the field C(j). The affine points 

2 

of Ej are ordered pairs from the algebraic closure, elements of C(j) . The 
finite A^-torsion x-coordinates of Ej are {/q V }, functions on X(N) rather 
than X(l), algebraically elements of C(j) rather than of C(j). The universal 
enhanced elliptic curves (Ej, (/^o, /o,i))i {Ej,f\), and (Ej,f 0 ), each viewed 
as a family of enhanced elliptic curves, are essentially the same thing as the 
moduli spaces S(AT), Si (AT), and So(N), and therefore as the modular curves 
X(N), Xt(N), and X 0 (N). 

An argument is required to show that {f$ v } are the universal A'-torsion 
x-coordinates. Specialize the discussion of division polynomials in Section 7.1 
to the curve Ej to show that its A^-torsion x-coordinates are characterized by 
a polynomial condition, 

4> N (g,g,x)=0, feeZ[ fl2 , S3 ,4 5 = 27j/(j- 1728). 

Let v be a nonzero element of (Z/NZ) 2 . Consider the function field element 

M9,9,fol&C(X(N)). 
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For any r £ % such that j(r) {0, 1728}, £j( r ) specializes to an elliptic curve 
over C, and g(r) and f^ v (r) specialize to complex numbers. Since f^ v {r) is 
an iV-torsion x-coordinate of £y( T ), 

^(5(r),g(r),/ 0 ± >)) = 0. 

Thus 4>n(9i 9, fo V ) is a meromorphic function on the compact Riemann sur- 
face X(N) with infinitely many zeros F(N)t, forcing it to be the zero func- 
tion. This makes f^ v the x-coordinate of a point R of the universal curve 
Ej such that [N]R — 0^.. A complementary argument using division equa- 
tions tpM where M \ N shows that the order of R is exactly N as claimed 
(Exercise 7.5.5). 

Let x(Ej[N]) denote the set of x-coordinates of nonzero points of Ej[N] 
as an alternative notation to {f^ v }- Thus x(Ej[N]) C C(j). Proposition 7.5.1 
says that adjoining fifi and / 0 ,i to C(j) adjoins all of x (E 3 \N\). That is, the 
generators t/N + A T and 1/N + A T of (C/A T )[N], where the group law is 
addition modulo A T , map to generators of x(Ej[N]), where the group law is 
defined by rational functions. (Making this statement precise requires another 
specialization argument as in the previous paragraph.) More generally, the 
first part of Proposition 7.5.1 and of the discussion after its proof rephrase as 

Proposition 7.5.2. Let Ej be the universal elliptic curve (7.12). The field of 
meromorphic functions on X(N) is the field generated by the x-coordinates of 
nonzero N -torsion points on Ej, 

C(X(N))=C(j,x(E j [N})). 

Thus the field extension C(j, x(Ej[N]))/C(j) is Galois and its group is 

Gal(Ca^[iV]))/C(. 7 )) = SL 2 (Z/7VZ)/{±/}. 

We now adjoin the y-coordinates of Ej[N] to C(j) as well, thus eliminating 
the quotient by {±7} from the Galois group. That is, let y(Ej[N]) denote the 
set of y-coordinates of nonzero iV-torsion points of Ej . Analytically these are 
the functions 




Since they are not defined on X(N) because of the square root, but rather on 
a two-sheeted surface lying over X(N), it is easiest just to think of them as 
elements of C(j). We will not use the notation 5q( t ) an Y further, but special 
cases of it are implicit in the canonical (up to negation) basis (P T , Q T ) of the 
universal TV-torsion group Ej[N], defined by (7.11) but now viewing r as a 
variable. Let C(j, Ej[N]) denote the field generated over C(j) by x(Ej[N]) U 
y{Ej[N\). Consider the field containments 



C(j) c C(j,x(Ej[N])) c C(j,Ej[N)) c C(j). 
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Corollary 7.5.3. The field extension C(j, Ej[N])/C(j) is Galois and its 
group is 

Gal(C(i,^[AT])/C(j)) - SL 2 (Z/7VZ). 

Proof. Let a : C(j, Ej[N]) — > C(j) be an embedding that fixes C(j') point- 
wise. Since the extension C(j,x(Ej[N]))/C(j) is Galois, a restricts to an 
element of Gal(C(j, a;(^[7V]))/C(j)). The set y{Ej[N]) consists of pairs of 
square roots of elements of C(j, x(Ej[N])) permuted by a, so a permutes 
y(Ej [N]) as well, making it an automorphism of C(j, Ej [N]). Thus the exten- 
sion C(j, Ej [N])/C(j) is Galois. 

Let H be its Galois group. The ordered basis (P T , Q T ) of Ej[N] over Z/NZ 
specifies an injective representation (Exercise 7.5.6) 



p : H — > GL 2 (Z/iVZ), 



Q% 



= p(a) 



Pr 
Qr 



We need to prove that p(H) = SL 2 (Z/7VZ). 

combines with Galois 



For any er G H the relation ^„ = p(a) 



p T 

Qr 



invariance of the Weil pairing and Corollary 7.4.2 to show that 

e N (Pr,Q T r = e N (P?,Q°) - e N (P T ,Q T ) dctp(CT) - 

But Cm{PttQt) is an iVth root of unity, lying in C and therefore fixed 
by a, so the displayed relation is in fact ejv(P T ,Q T ) = eff(P T , Q T ) detp ^- 
Since e^(P T1 Q T ) is a primitive Nth root by Corollary 7.4.2, det p(a) = 1 
in (Z/NZ)*. This shows that p(H) C SL 2 (Z/iVZ). 

Let K = Gal(C(j, Ej [N])/C(j, x(Ej [N]))) be the subgroup of H whose 
action on Ej[N] preserves x-coordinates. Since points with the same x- 
coordinate are equal or opposite, if a e K then P° = ±P T and Q° = ±Q T and 
p{a) £ SL 2 (Z/7VZ), so p(a) e {±7}. Conversely, if a e H and p(a) G {±7} 
then P a = ±P for all P e Ej[N], so a e K. That is, K = ,^({±7}), 
showing that \K\ < 2. By Proposition 7.5.2 Gal(C{j,x{Ej[N]))/C(j)) ^ 
SL 2 (Z/iVZ)/{±7}, and consequently |77| = |SL 2 (Z/iVZ)/{±7}| \K\. Thus 
[SL 2 (Z/7VZ) : p(H)\ < 2. 

The case [SL 2 (Z/7VZ) : p(H)} = 2 can arise only if \K\ = 1, i.e., 
-I i p(H). In this case {±7}p(77) = SL 2 (Z/iVZ) and so one of ± [? ~J] 
is in p(H). But this gives —7 = (± [" ~o]) 2 € p(H), contradiction, and so 
p(H) = SL 2 (Z/7VZ) as desired. □ 

The proof has shown that adjoining y-coordinates enlarges the Galois 
group by adjoining an clement that maps under p to —7, i.e., the clement 
negates points by negating y-coordinates, so it can be denoted —1. Modulo 
this, the representation p is the map Q~ x from before (Exercise 7.5.7). 

Figure 7.2 summarizes some of the results of this section. The displayed 
groups are the images under p of the actual Galois groups, obtained by lift- 
ing (7.9) and (7.10) to SL 2 (Z/7VZ) since p and B^ 1 are compatible. To re- 
iterate, one can think of the transcendental j as parameterizing a family of 
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elliptic curves, of the functions /i j0 and / 0 ,i = fi as iV-torsion x-coordinates 
of the points P T and Q T on the curves, and of the function / 0 as the sum of 
x-coordinates of the nonzero points of the iV-cyclic subgroup (Q T ). Thus the 
algebraic extensions of C(j) in the figure carry torsion data about the curves 
to describe moduli spaces, i.e., modular curves. At the top, y-coordinates are 
adjoined as well to eliminate a quotient by {±7} from the Galois groups. As 
the diagram shows, one can think of Xi(N) as being defined by the relation 
between j and f\ over C, and of X 0 (N) as being defined by the relation be- 
tween j and fo- To study the modular curves as algebraic curves over the 
rational numbers the underlying field needs to be changed from C to Q. This 
will be done in the next section. 

C(j,Ej[N]) 
C(X(iV))=C(i,/ 1 ,o,/ 0 ,i) 
C(X 1 (N)) = CUJi) 
C(X 0 (N)) = C(j,f 0 ) = C(j,j N ) 
C(A(1)) = C(i) 



{±[jm 



\{[o b d }} 



>SL 2 (Z/NZ) 



Figure 7.2. Fields and groups over C 



Exercises 

7.5.1. (a) Show that /q is meromorphic on %. 

(b) Show that /Jo 7 = f^P for all 7 e SL 2 (Z). 

(c) Show that /q is meromorphic at 00. More specifically, show that for 
v = (c v ,d v ) as in the section, 

Um D = /-2C(2) if c v ^ 0 (mod N), 

i m (r)^ T[ N ' 1-2((2)|^ 2 E„ eMJV) ^ ifc„ = 0(modAr). 

(A hint for this exercise is at the end of the book.) 

(d) Explain why showing that each /g is meromorphic at 00 checks the 
other cusps as well. 

7.5.2. What modifications to Exercise 1.2.11 show that j N e C(X Q (N))7 

7.5.3. (a) Show that the map 9 is a homomorphism. 

(b) Consider the set {/g : d € (Z/NZ — 0)/±} and the field containments 
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C(X(l)) c C(i,{/ 0 3 }) c C(AVAO)) C C(X(N)). 



Show that since / 0 07 = fo f° r 7 € SL2(Z), the fixing subgroup in 
SL 2 (Z)/{±/}r(A0 of the field C(j, {/£}) is {±I}r 1 (N)/{±I}r(N), and thus 
CiX^N)) = C(i, {/ 0 d }) and (7.9) holds, 
(c) Similarly, show that 



fo 0 7 = > ^ /o 



AT-l 

£ 

5=1 



«(c,d) 



for 7 = [-»] eSL 2 (Z), 



i.e., /007 is the sum of the ^-coordinates of the nonzero points of the iV-cyclic 

subgroup of £j( r ) generated by either point with x-coordinate /o C '^(t). We 
already know that / 0 e C(X 0 (N)). Use the result of Exercise 7.5.1(c) to 
show that if 7 <^ -To(^) then fo and fo o 7 take different values at the cusp 
r(N)oo oiX(N). Thus C(X 0 (N)) = C(j,f 0 ) and (7.10) holds. Also, for any 
7 e SL 2 (Z) show that Jat(7(t)) = j(~f'(NT)) where 



AT 0 
0 1 



l/N 0 
0 1 



and show that consequently C(Xo(N)) = C(j, jjv) a s well. (A hint for this 
exercise is at the end of the book.) 

7.5.4. Show that the j-invariant of the universal elliptic curve Ej is the vari- 
able j. (A hint for this exercise is at the end of the book.) 

7.5.5. Complete the specialization argument in the section by showing that 
if R is a point of the universal curve Ej with x-coordinate f^ v then the order 
of R is exactly N. 

7.5.6. Show that the map p is an injective representation. 

7.5.7. Supply details as necessary to the following proof that the representa- 
tion p is the map 9" 1 from earlier in the section modulo signs. For any nonzero 
row vector v G (Z/NZ) 2 and any a e Gal(C(j, Ej[N])/C(j)) ~ SL 2 (Z/A^Z) 
the definition of p gives 



Pr 
Qr 



) a = vp(cr) 



Pr 
Qr 



and ignoring negation give the relation on x-coordinatcs, 



;( ± v 



Pr 
Qr 



v(±*) 



x( ± vp(±a) 



Pr 
Qr 



Specifically, 

±v h-> ±wp(±cr). 

Since a pair ±er in the Galois group corresponds to a single element in the quo- 
tient group Gal(C(A(A))/C(A(l))) S SL 2 (Z/iVZ)/{±/}, this is the same as 
the earlier action of the quotient group via as shown in (7.8). 
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7.6 Function fields over Q 

Working over the complex numbers C we have considered the universal 
curve Ej and the field containments 

C(j)cC(j,Ej[N])cC{f). 

Corollary 7.5.3 established that the extension C(j, Ej[N])/C(j) is Galois with 
group SL2(Z/iVZ). This section studies the situation when the underlying field 
is changed to the rational numbers Q. The result will be that the Galois group 
enlarges to GL 2 (Z/iVZ). Large enough subgroups correspond to intermediate 
fields that are the function fields of algebraic curves over the rational numbers. 
The next section will show that the intermediate fields Q(j, /o) and Q(j, /i) 
define X 0 (N) and X^N) over Q. The field Q(j, /i,o, /o,i) defines X(N) over 
the field Q(fi N ) where fi N is the group of complex iVth roots of unity. 

Since Q is the prime subficld of C, much of the algebraic structure from 
the previous section carries over. The equation defining Ej has its coeffi- 
cients in Q(j). Viewing the curve as defined over this field means considering 

2 

points (x, y) <G Q(j) satisfying the equation. This includes the nonzero points 
of Ej[N] over C(j) from before, and in the field containments 



Q(i)cQ(i,^[Ar])cQ(i) 

the extension Q(j, Ej[N])/Q(j) is again Galois. The only difference between 
the field theory over Q and over C will involve fi N . 
Consider the Galois group 

H Q = Gal(Q(n N ,j,E j [N})/Q(j)) 

and the representation 

p:H Q ^ GL 2 (Z/iVZ) 

describing how Hq permutes Ej [N] . This is defined as before in terms of the 
ordered basis (P T ,Q T ) of Ej[N] over Z/NZ from (7.11), so that 



P? 

Qr 



= P{<?) 



Pr 
Qr 



O e Hq. 



Lemma 7.6.1. The function det p describes how Hq permutes n N , 

M ff = M dotp(ff \ Me/^v, <y^H Q . 

{Here p? is p acted on by a while ^ dct P( a ) is p raised to the power det p(cr).) 



This is shown with the Weil pairing as in the proof of Corollary 7.5.3 
(Exercise 7.6.1). To use the lemma, suppose a e Hq fixes Ej[N\. This means 



H. 
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Q(^jv) 



(z/nz)* 



> H Q -A GL 2 (Z/7VZ) 



that a G kcr(/?), so cr € ker(det p) and the lemma says that cr fixes /^jy. Thus 
jiijv C Q(j, Ej[N]) by Galois theory showing that Hq is in fact the Galois 
group of Q(j, Ej[N]) over Q(j), the analog over Q of the group H in the 
proof of Corollary 7.5.3. Consider the configuration of fields and groups in 
Figure 7.3. Since the field extension is generated by Ej[N], now p clearly is 
injcctivc, and by the lemma it restricts to 

P ■■ H QM — ► SL 2 (Z/iVZ). 

To analyze the images of Hq and i?Q( Mjv ) under p, recall a result from Galois 
theory. 

Lemma 7.6.2 (Restriction Lemma). Let k and F be extension fields of f 
inside K with F/f Galois. Suppose K = kF. Then K/k is Galois, there is a 
natural injection 

Gal(K/k) — ► Gal(F/f), 
and the image is Gal(F/(kn F)). 



K = kF 
k 



knF 

f 



Figure 7.4. Setup for the Restriction Lemma 



Proof. The situation is shown in Figure 7.4. Any map a : K — > K fixing k 
restricts to a map F — > F fixing k D F. Since the extension F/(kfl F) is 
Galois, the restriction is an automorphism of F and therefore a is an auto- 
morphism of K = kF. This shows that K/k is Galois and that restriction 
gives a homomorphism 



Gal(K/k) — -> Gal(F/(k n F)) C Gal(F/f). 
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If the restriction of some a fixes F along with k then it fixes K and is trivial, 
so the restriction map injects. Since the fixed field of K under Gal(K/k) is k, 
the fixed field of F under the restriction is k n F and so restriction maps to 
all of Gal(F/knF). □ 

One application of the lemma is implicit in Figure 7.3, where {Z/NZ)* is 
displayed as Gal(Q(/x jv , j)/Q(j)) (Exercise 7.6.2). For another, consider the 
situation shown in Figure 7.5. 



SL 2 (Z/iVZ) 



C(j,Ej[N}) 
C(j) 



c(j)nQ(. 7 ,^[iV]) 



H. 



Figure 7.5. Applying the Restriction Lemma 



The Restriction Lemma shows that SL 2 (Z/iVZ) injects into i?Q( Mjv ). But 
also p injects in the other direction, making the two groups isomorphic since 
they are finite, 

P ■■ H Q{f t N) ^ SL 2 (Z/iVZ). 

Now the lemma also shows that C(j) n Q(j, Ej[N]) — Q(/x w ,j), and inter- 
secting with Q gives 

Q{j,E j [N\)nQ = Q(jM N ). 

Also, Figure 7.3 now shows that 

\H Q \ - \H QM \ \(Z/NZ)*\ = |SL 2 (Z/ATZ)| \{Z/NZ)*\. 

But |SL 2 (Z/ATZ)| |(Z/JVZ)*j = |GL 2 (Z/iVZ)|, so the representation p sur- 
jects, 

p: H Q ^GL 2 (Z/NZ). 

This lets us specify which intermediate fields of Q(j, Ej[N])/Q(j) corre- 
spond to algebraic curves over Q. Let K be an intermediate field and let the 
corresponding subgroup of Hq be K = Gal(Q(j, Ej [N])/K), as in Figure 7.6. 

Recall that det p describes how Hq permutes n N . This gives the equiva- 
lences 

k n Q = Q k n Q(/xjv) = Q 

dctp : K — > (Z/NZ)* surjects. 
Summing up the results of this section, 
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Q(j,Ej[N]) 



K • 

Q(j) 



K 



Hq ^ GL 2 (Z/7VZ) 



Figure 7.6. Subgroup and fixed field 



Theorem 7.6.3. Let Hq denote the Galois group of the field extension 
Q(j,Ej[N])/Q(j). There is an isomorphism 

p :H Q ^GL 2 (Z/7VZ). 

Let K be an intermediate field and let K be the corresponding subgroup of Hq. 
Then 

K n Q = Q det p : K — > (Z/NZ)* surjects. 

Thus K is the function field of an algebraic curve over Q if and only if det p 
surjects. 

The last statement in the theorem follows from Theorem 7.2.5. 
Exercises 

7.6.1. Prove Lemma 7.6.1. 

7.6.2. Justify the relation Gal(Q(/x w , j)/Q(j)) ~ (Z/NZ)* shown in Fig- 
ure 7.3. 



7.7 Modular curves as algebraic curves and Modularity 

This section defines the modular curves Xo(N) and Xi(N) as algebraic curves 
over Q and then restates the Modularity Theorem algebraically. 

Consider three intermediate fields of the extension Q(j, Ej[N])/Q(j), 

K 0 = Q(j,/o), K' 0 = Q(j,j N ), Ki = Q(j, /i), 

analogous to the function fields C(j, /o) = C(j, j^v) and C(j, /i) of the modu- 
lar curves X 0 (N) and Xi(N) as complex algebraic curves. The subgroups K 0 , 
Kq, and K\ of Hq corresponding to K 0 , Kq, and Ki satisfy (Exercise 7.7.1) 



p(K 0 ) = p(K' 0 ) 



a b 
0 d 



p(Ki) - ± 



a b 
0 1 



(7.13) 



running through all such matrices in GL/2(Z/iVZ), so in fact Ko = Kq. Thus 
detp : Kj — > (Z/NZ)* surjects for j = 0,1, and so by Theorem 7.6.3 the 
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fields K 0 and Ki are the function fields of nonsingular projective algebraic 
curves over Q. Denote these curves X 0 (N) l!l \ g and X 1 (N) a \ g . 

To relate the algebraic curve Xi(N) a \ g defined over Q to the complex mod- 
ular curve Xi(N) viewed as an algebraic curve over C, we need the following 
theorem from algebraic geometry. 

Theorem 7.7.1. Let k be a field, let C be a nonsingular projective algebraic 
curve over k defined by polynomials ipi, . . . , ip m G k[a;i, . . . , x n ], and let the 
function field of C be k(C) = k(t)[u]/(p(it)). Let K be a field containing k. 
Then the polynomials ipi, . . . , tp m , viewed as elements of~K\x\, . . . , x n ], define 
a nonsingular algebraic curve C over K, and its function field is K(C") = 
K(t)[u]/(p(u)). ' 

For example, any nonsingular Weierstrass equation over Q defines an ellip- 
tic curve over C, as we earlier observed directly. In the theorem 
understood to be indctcrminants over K as well as over k. The theorem says 
that after enlarging the field of definition from k to K, the ideal generated by 
the ipi remains prime and the nonsingularity condition continues to hold at 
each point of the curve over the larger field. The form of K(C") shows that 
the minimal polynomial p of u over k(i) is irreducible over K(i), making it 
the minimal polynomial of u over K(t) as well. 

To apply the theorem to modular curves, let k = Q and C = Xi(N) a \ g , 
and let p\ G Q(j)[a;] be the minimal polynomial of fi over Q(j), so that 

Q(C) = QC/,/i) = Q(j)N/<pi). 

Let K = C. Since this field is algebraically closed, the theorem says that the 
C-points (points with coordinates in C) that satisfy the polynomials defining 
Xi(N) a i g over Q form a curve C = Xi(N) a i g ^c over C with function field 

C(X 1 (NU SiC ) = C(j)[x}/( Pl ), 

and pi is also the minimal polynomial of /i over C(j) (see Exercise 7.7.2 for 
an elementary proof of this last fact). Since the minimal polynomials are the 
same, the function field C(j, /i) of the Ricmann surface Xi(N) — or of the 
corresponding nonsingular algebraic curve over C, also denoted Xi(N) — is 

C(X 1 (N)) = C(j,f 1 )=C(j)[x]/{p 1 ), 

and since the two function fields agree Theorem 7.2.5 says that Xi(N) — 
Xi(N) a i gt c up to isomorphism over C as desired. The argument for X 0 (N) 
is virtually identical, using either f 0 or jjy in place of f\. For the algebraic 
model of X(N) see Exercise 7.7.3. 

The planar model of X 1 (A f ) alg is defined by the polynomial p\. Clear its 
denominator to get a polynomial <p\ G Q[j, x\. Then the points of the planar 
model are 

^i(A0aig nar = {{3,x) G Q 2 : <Pl(j' X ) = °}- 
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As discussed after Theorem 7.2.5 there is a birational equivalence over Q 
between the planar model Xi(N)^ nar and the actual nonsingular projective 
curve Xi(N) a i g itself. As a set map this takes all but finitely many points 
of the planar model to all but finitely many points of the curve. Since the 
description of Xi(N) a i g is abstract, we will sometimes use its planar model in 
its stead for computations. 

The values j(a 3 ) = 0 an d = 1728 show that the elliptic points of 
the Riemann surface X± (N) correspond to points (j, f\ ) with rational first 
coordinate and thus algebraic coordinates in the planar model of X\(N)gx s ,c- 
That is, no elliptic points are lost in the transition from X\(N) to Xi{N) a \ g . 
The same is true for cusps, viewing j{oo) = oo — [1,0] as a rational point 
of the projective plane. This argument holds for a general modular curve, 
regardless of whether it is defined over Q or some larger number field K C Q. 

Version Xc of the Modularity Theorem, from the end of Chapter 2, has a 
variant in the language of algebraic geometry. 

Theorem 7.7.2 (Modularity Theorem, Version Xq). Let E be an ellip- 
tic curve over Q. Then for some positive integer N there exists a surjective 
morphism over Q of curves over Q from the modular curve A" 0 (A r ) alg to the 
elliptic curve E, 

AVAOaig — ► E. 

By contrast, Version Xc assumes a complex elliptic curve E with rational 
j-invariant and guarantees a holomorphic map X 0 (N) — > E of Riemann 
surfaces. 

Version Xq easily implies Version Xc- Any complex elliptic curve E 
with rational j-invariant, viewed as an algebraic curve over C, is isomor- 
phic over C to the C-points E'(C) of an elliptic curve E' over Q. Ver- 
sion Xq gives a map X 0 (N) a \ g — > E' over Q and this extends to C-points, 
^o(A0ai g (C) — ► E'(C). A map X 0 (N) — > E follows immediately and can 
be viewed as a holomorphic map of Riemann surfaces as desired. On the other 
hand, showing that Version Xc implies Version Xq, i.e., transferring the the- 
orem from complex analysis to algebraic geometry and refining the field of 
definition from C to Q, is beyond the scope of this text. The argument is 
given in the appendix to [Maz91], and it involves replacing the N in Ver- 
sion Xc by a possibly different N in the resulting Version Xq. In any case, 
the approach to proving Modularity developed by Wiles ultimately gives the 
a priori stronger Version Xq . 

Definition 7.7.3. Let E be an elliptic curve over Q. The smallest N that 
can occur in Version Xq of the Modularity Theorem is called the analytic 
conductor of E. 

Thus the analytic conductor is well defined on isomorphism classes over Q 
of elliptic curves over Q. The next section will show that isogeny over Q 
of elliptic curves over Q is an equivalence relation, so in fact the analytic 
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conductor is well denned on isogeny classes of elliptic curves over Q. We will 
say more about the analytic conductor in Chapter 8. 

Similar algebraic refinements of Versions Jc and Ac of the Modularity 
Theorem from Chapter 6 require the notion of a variety, the higher-dimen- 
sional analog of an algebraic curve. As in Section 7.2, let m and n be positive 
integers, consider a set of m polynomials over Q in n variables, and suppose 
that the ideal they generate in the ring of polynomials over Q, 

I = (fl, ■ • - ,</?m) C Q[.Ti, . . .,X n ], 

is prime. Then the set V of simultaneous solutions of the ideal, 
V = {P E Q™ : (fi(P) = 0 for all <p e /}, 

is a variety over Q. The coordinate ring of V over Q is the integral domain 

Q[V] = Q[ Xl ,...,x n ]/I, 

and the function field of V over Q is the quotient field of the coordinate ring, 

Q(V) = {F=f/g:f,geQ[V], g^O}. 

The dimension of V is the transcendence degree of Q(V) over Q, i.e., the 
number of algebraically independent transcendcntals in Q{V), the fewest tran- 
sccndentals one can adjoin to Q so that the remaining extension up to Q(V) 
is algebraic. In particular a curve is a 1-dimensional variety. If V is a variety 
over Q and for each point P e V the m-by-n derivative matrix [Djifi(P)] 
has rank n — dim(V) then V is nonsingular. As with curves, homogenizing 
the defining polynomials leads to a homogeneous version Vh om of the vari- 
ety, and the definition of nonsingularity extends to Vhom- Morphisms between 
nonsingular projective varieties are defined as between curves. 

The Jacobians from Chapter 6 have algebraic descriptions as nonsingular 
projective varieties, defined over Q since modular curves are defined over Q, 
and having the same dimensions algebraically as they do as complex tori. 
We will sec in Section 7.9 that the Hecke operators are defined over Q as 
well. Consequently the construction of Abelian varieties in Definition 6.6.3 as 
quotients of Jacobians, defined by the Hecke operators, also gives varieties 
defined over Q, and again the two notions of dimension agree. Versions Jc 
and Ac of the Modularity Theorem modify accordingly. 

Theorem 7.7.4 (Modularity Theorem, Version Jq). Let E be an ellip- 
tic curve over Q. Then for some positive integer N there exists a surjective 
morphism over Q of varieties over Q 



Jo(A0ai g — > E. 
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Theorem 7.7.5 (Modularity Theorem, Version Aq). Let E be an elliptic 
curve over Q. Then for some positive integer N and some newform f G 
S2(ro(N)) there exists a surjective morphism over Q of varieties over Q 



Our policy will be to give statements of Modularity involving varieties but, 
since we have not studied varieties, to argue using only curves. In particular, 
we never use the variety structure of the Abelian variety of a modular form. We 
mention in passing that rational maps between nonsingular projective varieties 
are defined as for curves, but a rational map between nonsingular projective 
varieties need not be a morphism; this is particular to curves. The Varieties- 
Fields Correspondence involves rational maps rather than morphisms, and the 
rational maps are dominant rather than surjective, where dominant means 
mapping to all of the codomain except possibly a proper subvariety. 

Applications of Modularity to number theory typically rely on the alge- 
braic versions of the Modularity Theorem given in this section. A striking ex- 
ample is the construction of rational points on elliptic curves, meaning points 
whose coordinates are rational. The key idea is that a natural construction of 
Heegner points on modular curves, points with algebraic coordinates, follows 
from the moduli space point of view. Taking the images of these points on 
elliptic curves under the map of Version Xq and then symmetrizing under 
conjugation gives points with rational coordinates. As mentioned early in this 
chapter, the Mordcll Weil Theorem states that the group of rational points 
of an elliptic curve over Q takes the form T © Z r where T is the torsion sub- 
group, the group of points of finite order, and r > 0 is the rank. The Birch and 
Swinnerton-Dyer Conjecture, to be discussed at the end of Chapter 8, pro- 
vides a formula for r. Gross and Zagier showed that when the conjectured r 
is 1 , the Heegner point construction in fact yields points of infinite order. See 
Henri Darmon's article in [CSS97] for more on this subject. 

Exercises 

7.7.1. Show that the images of K 0 , K' Q , and K\ under p are given by (7.13). 
(A hint for this exercise is at the end of the book.) 

7.7.2. Let p\ t c G C(j)[x] be the minimal polynomial of f\ over C(j) and let 
Pi,Q € Q(j)M t" c th° minimal polynomial of f\ over Q(j), a multiple of pi.c 
in C(j)[a;]. Let e^v be 2 if N <= {1,2} and 1 if N > 2. Show that by Galois 
theory and Figure 7.2, 



and show similarly that this is also deg^^cj). Thus the polynomials are equal 
since both are monic. 




deg(pi, c ) - e N \SL 2 (Z/NZ)\/(2N), 
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7.7.3. The relevant field for the algebraic model of X(N) is 

K = Q(i,/i, 0 ,/o,i), 

analogous to the function field C(j, /^o, /o,i) of X(N) as a complex algebraic 
curve. Show that the corresponding subgroup K of Hq satisfies p(K) = {±J}, 
so K n Q = Q(fi N ). The algebraic model of the modular curve for r(N) is 
the curve X(N) a \ g over Q(fi N ) with function field Q(fj, N )(X(N) a i g ) = K. 
Connect this to the complex modular curve X(N). 

7.8 Isogenies algebraically 

Let N be a positive integer. Recall the moduli space Si(iV) from Section 1.5, 

Si (AT) = {equivalence classes [E, Q] of enhanced elliptic curves for ri(AT)}, 

and recall the moduli space interpretation of the Hecke operator T p on the 
divisor group of Si(iV) from Section 5.2, 

T P [E,Q]=J2[E/C,Q + C], 

c 

where the sum is taken over all order p subgroups C of E with C n (Q) — {0}. 
So far we understand the quotient E/C only when E is a complex torus. This 
section uses both parts of the Curves-Fields Correspondence to construct the 
quotient when E is an algebraic elliptic curve instead. Thus T p can be viewed 
in algebraic terms. The algebraic formulation of T p will be elaborated in the 
next section and used in Chapter 8. 

Section 1.3 defined an isogeny of complex tori as a nonzero holomorphic 
homomorphism, which thus surjects and has finite kernel. Any such isogeny 
is the quotient map by a finite subgroup followed by an isomorphism. Every 
isogeny tp of complex tori has a dual isogeny tp in the other direction such 
that tp o tp — [N] where [N] = deg(tp) = | ker(p)\. Similarly, define an isogeny 
between algebraic elliptic curves to be a nonzero holomorphic morphism. 

For an algebraic construction of the quotient isogeny, let k be a field of 
characteristic 0 and let E be an elliptic curve over k. Let C be any finite 
subgroup of E, so that C C E[N] for some N, and let 1 = k(E[N}) be 
the field generated over k by the coordinates of the nonzero points of E[N]. 
According to Theorem 7.7.1 we may regard E as an elliptic curve over 1. When 
k is algebraically closed, e.g., k = C or k = Q, then 1 = k and there is no 
need to introduce 1 at all, but we cannot always assume this. 

For each point P e C there is a corresponding automorphism r|> of the 
function field 1(E), 

rpf -/orp for all fel(E). 

Since Tp + p> = t p o t p ,, the group C naturally identifies with a subgroup 
of Aut(l(£)), 
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C^{ T * P :Pe C}. 

Let \{E) C denote the corresponding fixed subfield of 1(E). This is a function 
field over 1, so by Theorem 7.2.5 there exists a curve E' over 1 and an 1-iso- 
morphism i : l(E') — > \(E) C . View this as an 1-injection, 

i : l(E') — -> 1(E). 

By Theorem 7.2.6 there is a corresponding surjective morphism over 1, 

ip : E — > E', ip* =i. 

This will turn out to be the desired quotient isogeny. 

For any point X e E let Y = <p(X) e E' and consider the set tp^ 1 (Y) C E. 
For any P e C and any / e \(E) C we have f(X + P) = (rpf)(X) = f(X), 
so that the construction of the quotient map given at the end of Section 7.2 
shows that ip(X + P) = (p(X) = Y. That is, X + C C (f^ 1 (Y). But also, 

\<P~HY)\ < Ex>e v -HYfx'(^) = deg(^) by (7.4) 

= [1(E) : i(l(E'))] since i = ip* 

= [1(E) : l(E) c ] = \C\ by Galois theory, 

so that in fact p^ x (Y) = X + C. In the process this has shown that ip is 
unramified and deg(ip) ~ \C\. 

The image curve E' = <p(E) is again an elliptic curve with 0 E > = p(0e)- 
Indeed, algebraic versions of the genus and the Riemann-Hurwitz formula 
from Chapter 3 exist, along with a more general notion of an elliptic curve as 
an algebraic curve of genus 1 with a distinguished point designated as its 0. 
An argument using the algebraic Riemann-Roch Theorem shows that any 
such curve is isomorphic to a curve in Weierstrass form. (See Exercise 7.8.1 
for an example and then a sketch of the proof. Explaining all of this would 
lead us too far afield, but the interested reader should see [Sil86].) Since tp 
is unramified, it follows that (E' 1 p(0 E )) is an elliptic curve as claimed. This 
makes ip a homomorphism as observed after Theorem 7.3.3, and its kernel is 

ker(< / ?) = (^- 1 ((4)=0 s + C = C. 

Thus we can view E' and tp as E/C and the quotient isogeny, transferred from 
the complex analytic setting of Chapter 1 to morphisms of algebraic elliptic 
curves over 1, 

E^E/C, Q^Q + C. 

Exercise 7.8.2 shows that in fact the constructions of E/C and <p are de- 
fined over a field that is generally smaller than 1. In the next section Exer- 
cise 7.9.2 will construct a rational function over Q of two variables such 
that j(E/C) = r^(j(E),x(C)) where E is an elliptic curve over Q, C is a 
cyclic subgroup of order N, and x(C) is the sum of the x-coordinates of the 
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nonzero points of C. This exercise works with a field of definition that is not 
algebraically closed. 

As for the dual isogeny, 

Theorem 7.8.1. Letk be afield. Isogeny over It between elliptic curves over la 
is an equivalence relation. That is, if tp : E — > E' is an isogeny over k of 
elliptic curves over k then there exists a dual isogeny tp : E' — > E, also 
defined over k. 

Proof. Given tp : E — > E', let C — ker (tp), let N — \C\, and consider 
also [TV] : E — > E, again an isogeny over k. Similarly to before, identify 
each point P e E[N] with the automorphism r|> of the function field k(E), 
and thus identify E[N] with a subgroup of Aut(k(_E)). Since ker(y) = C, 
the embedded image p*(k(E')) in k(E) is the subficld of functions that don't 
separate points of C, i.e., it is k(E) c . Similarly [N]*(k(E)) = k(E) E l N l Thus 
in the following configuration the horizontal arrows are k-isomorphisms: 

HE) 



k{E')-^k{E) c 

k(E)^^k(E) E W. 
By Theorem 7.2.6 the k-injection 

HE)^r^k(E') 

takes the form tp* where ip : E' — > E is a morphism over k. And tp* o tp* = 
[N]* : k(E) — > k(E), so tp o tp = [N]. That is, tp is the desired dual isogeny. 

□ 

Exercise 7.8.3 gives an example of an isogeny and its dual. Exercise 7.8.4 
is to show that also tp o tp = [deg(y>)]. Exercise 7.8.5 is to show that tp* = tp* 
as maps from Pic°(£') to Pic°(£;). 

The more general notion of elliptic curve that this section has mentioned 
gives rise to our first examples of the Modularity Theorem. Since the modular 
curves X 0 (N) a i g are defined over Q, those that have genus 1 are modular 
elliptic curves if they contain a point with rational coordinates. We show 
inductively that this is the case for all N. Recall that X 0 (N) a \ g is a subset 
of ^o(A0ai g ,C) which is identified with the Riemann surface X 0 (N). Similarly 
to the argument in Section 7.7, Xo(N) a \ g retains the cusps of the Riemann 
surface. Specifically we show that the point of X 0 (N) a i g corresponding to the 
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cusp r o (N)(0) is rational. Since the map j : X 0 (l) — > A" 0 (l) a ig,c = P 1 (C) 
takes SL 2 (Z)(0) to the rational point oo — [1,0], this holds for N — 1. Now 
suppose it holds for some N and let p be prime. Let P e X n (Np) a \ g be the 
point corresponding to r o (Np)(0). The projection map tt : X 0 (Np) a i g — > 
Xo(A r ) a ig is a morphism over Q of curves over Q. Its degree is p or p + 1 
according to whether p \ N. The point Q — n(P) e Xo(N) a i g corresponds 
to Jo(^V)(0), and working complex analytically shows that the ramification 
degree ep(ir) is p, cf. Section 3.1. Consider the divisor D = (Q) on X 0 (N) a \ g . 
By induction Q has rational coordinates, and D is therefore invariant under 
the automorphisms of Q. Its pullback ir*D is of the form p(P) or p(P) + (P') 
for some P' ^ P according to whether p\ N. For any automorphism a of Q, 

{TT*D) a = i:*{D a ) = n*(D), 

showing that P has rational coordinates as well. This completes the induction. 

Thus if A"o(AQ a ig has genus 1 and / is the unique newform of ^(-To^)) 
then there are isomorphisms of elliptic curves over Q, 

AVAOaig — ► J O (A0a lg — »• 4,alg- 

An algorithm in [Cre97] gives an equation for A'j and therefore for X 0 (iV) a ig. 
In particular, Exercise 3.1.4(e) shows that Ao(p) a ig is a modular elliptic curve 
for p = 11, 17, 19. The Weierstrass equations produced by the algorithm are 
in a generalized form to be mentioned in Exercise 7.8.1 and explained at the 
beginning of the next chapter, 

Ayil)ai g : y 2 + y - x 3 - x 2 - 10a; - 20, 
*o(17)aig : y 2 + xy + y = x 3 - x 2 - x - 14, 
Ayi9)ai g : y 2 + y = x 3 + x 2 - 9x - 15. 

More generally, for any weight 2 newform / € <S 2 (-fo(-^0) with rational coef- 
ficients the associated Abelian variety A'j alg is a modular elliptic curve and 
the algorithm in [Cre97] gives an equation for it. 

Exercises 

7.8.1. (a) Consider the curve C : x 3 + y 3 = 1 and its point P = (0,1). 
Homogenize to C : x 3 + y 3 — z 3 and P — [0, 1, 1]. Show that replacing z by 
y+z/3 and then dehomogcnizing back to z = 1 puts the curve into Weierstrass 
form E and takes P to 0e- 

(b) Let E' be a nonsingular algebraic curve over k of genus 1 with a 
distinguished point P. Let t be a uniformizer at P. Use Corollary 3.4.2(d) 
of the Riemann-Roch Theorem, now interpreted as a theorem about alge- 
braic curves, to show that £([d]P) = d for all d € Z + . Use this to show that 
L(P) = k; that L([2]P) has basis {1,X}, where X has a double pole at P; 
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that L([3]P) has basis {1, X, Y}, where Y has a triple pole at P; that L([4]P) 
has basis {1,X,Y,X 2 }; that L([5]P) has basis {1, X, Y, X 2 , XY}; and that 
L([6]P) contains {1, X, Y, X 2 ,XY, X 3 , Y 2 }. Since this last set has seven el- 
ements it is linearly dependent. Explain why any linear relation on this set 
necessarily involves both X 3 and Y 2 , so that after rescaling over k it takes 
the form 



We will see in the next chapter that easy changes of variable reduce this 
relation to (Y') 2 = A(X') 3 - g 2 X' - g 3 . Thus (E',P) has a planar model in 
Weierstrass form. (A hint for this exercise is at the end of the book.) 

7.8.2. Let E be an elliptic curve over k and let C be a finite subgroup of E, 
so C C E[N] for some N. Let 1 = k(E[N]) and let G = Gal(l/k). Let H be 
the subgroup of G that fixes C as a set, 



and let k' be the corresponding fixed subfield of 1. (See Figure 7.7.) The 
quotient curve E/C and the quotient isogeny ip have been constructed over 1. 
This exercise shows that in fact they can be defined over k'. 



(a) Regarding E as an elliptic curve over 1, consider the corresponding 
function field 1(E). The group G acts on 1(E) from the left via its action 
on 1. Notate the action of a e G as / i-> °f. The group E[N] acts on 1(E) 
via translation, the action of each point P being / i->- Tpf. The semidirect 
product group H k C makes sense since H acts on C. Check that this group 
acts on 1(E) as 



Y 2 + mXY + a 3 Y = X 3 + a 2 X 2 + a 4 X + a 6 . 



H = {aeG: a(C) = C} 




Figure 7.7. Fields and groups for the quotient curve 
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(b) Let K be the subficld of 1(E) fixed by H k C. Show that K n 1 = k', 
and explain how this shows that K is the function field of a curve E' defined 
over k'. We don't yet know that E' is an elliptic curve. 

(c) Use Theorem 7.7.1 to show that l(E') is the subfield of 1(E) generated 
by 1 and K. Show that the field so generated is also the subficld of 1(E) fixed 
by C, making it the function field \(E/C) by the construction of quotient 
isogeny in the section. Therefore E' is isomorphic over 1 to E/C as desired. 

(d) Show that the 1-injection i : l(E') — > 1(S) from the section restricts 
to a k'-injection 

i :k'(E') —+k'(E), 

giving a morphism ip : E — > E' defined over k'. In particular <p(0e) € E'(k'), 
and along with (c) this shows that E' is an elliptic curve over k'. 

(e) Over what subfield of Q(j,Ej[N]) does this exercise show that the 
quotient isogeny Ej — > Ej/(Q T ) is defined, where Q T is the TV-torsion point 
as in Section 7.5? 

7.8.3. Consider the elliptic curves 

E : y 2 = Ax 3 - Ax, E' : f = Ax 3 + x. 

Let 

/ y 2 y(x 2 + can,. 

{m?>-&-) ={x ' y) - 

Show that (p takes E to E' and <p(0e) = 0e'- Compute ker(ip). Let 

f_ y(4x 2 -l) \ 
4x 2 ' Ax 2 ) ' 

Confirm that ip takes E' to E and that ij) o ip = [deg(y)] on E. 

7.8.4. Let cp : E — > E' be an isogeny and let ip be its dual isogeny. Show 
that also tpotp = [deg(<^)]. (A hint for this exercise is at the end of the book.) 

7.8.5. Let ip : E — > E' be an isogeny and let ip be its dual isogeny. Show 
that ^* = <P* a s maps from Pic (E') to Pic (E). (A hint for this exercise is 
at the end of the book.) 

7.9 Hecke operators algebraically 

The modular curve for A(^0 is now algebraic over Q. This section shows 
that the Hecke operators (d) and T p are defined over Q as well, obtaining an 
algebraic version over Q of a commutative diagram involving complex analytic 
objects, 
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Div(Si(JV)) — Z ^Div(S 1 (7V)) 



Div(Xi(JV)) — ^->Div(Xi(i\0). 



(7.14) 



This is diagram (5.8) slightly modified by changing ii(iV) to Xi(N) in the 
bottom row. The diagram shows that T p is defined compatibly on the complex 
analytic moduli space and modular curve for ri(N). The moduli space Si (TV) 
is defined in Section 1.5. The map ipi down the sides comes from the bijection 
ipi : Si(N) — > Yi(N) of Theorem 1.5.1, but it no longer surjects because 
Xi(N) also includes the cusps. The divisor group interpretations of T p are 
given in Section 5.2. This section also gives the similar algebraic diagram 
over Q with (d) in place of T p . 

To make the top row of diagram (7.14) algebraic, first continue to work 
over C. An enhanced complex algebraic elliptic curve for J\(iV) is an ordered 
pair (E, Q) where E is an algebraic elliptic curve over C and Q <G E is a point 
of order N, and two such pairs (E,Q) and (E',Q') are equivalent if some 
isomorphism E —} E' defined over C takes Q to Q'. The complex algebraic 
moduli space for J\(AT) is the set of equivalence classes 

Si(A r ) a i gi c = {enhanced complex algebraic elliptic curves for ri(iV)}/ <~, 

and its points are denoted [E, Q]. The complex analytic moduli space Si(iV) 
in (7.14) and the complex algebraic moduli space Si(7V) a i gi c are in bijective 
correspondence. Specifically, let E T be the curve y 2 = 4a; 3 — g2{T~)x — gz{r) 
for t e H, as in Section 7.5. Then the bijection from Si (AT) to Si(-/V) alg , c is 

[C/Ar, 1/N + A T ] ^ [E T , (p T (l/N), p' T (l/N))]. (7.15) 

Since the moduli spaces biject, the formula defining T p on the top row of (7.14) 
transfers to Div(Si(A r ) a i g>c ), 

T P [E,Q]=J2[E/C,Q + C}. (7.16) 

c 

This makes sense in the algebraic environment because the quotient isogeny 
construction of the previous section applies to curves over C. As always, the 
sum is taken over all order p subgroups C C E such that C fl (Q) — {0}, hence 
over all order p subgroups when p \ N. 

Now work over Q rather than C. An enhanced algebraic elliptic curve 
for r±(N) is an ordered pair (E,Q) where E is an algebraic elliptic curve 
over Q and Q € E is a point of order N, and two such pairs (E, Q) and 
(E' , Q') are equivalent if some isomorphism E ^> E' defined over Q takes Q 
to Q' . The algebraic moduli space for Zi(AT) is the set of equivalence classes 



Si(A r ) a ig = {enhanced algebraic elliptic curves for ri(N)}/ <~, 
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and its points are denoted [E,Q]. The intersection of an equivalence class in 
Si(iV)aig,c with Si(iV) a i g is an equivalence class in Si(_/V) alg (Exercise 7.9.1). 
Thus Si(iV)aig can be viewed as a subset of Si(AT) a i g] c- By Theorem 7.1.3 
torsion is retained from C to Q, so if E is defined over Q then its order p 
subgroups CC-E such that C n (Q) — {0} are the same as such subgroups of 
the corresponding complex curve Eq,. Also, the quotient isogeny construction 
restricts to curves over Q. Thus formula (7.16) restricts to Div(Si(-/V) a i g ), and 
this restriction is the desired algebraic version over Q of the top row of (7.14). 

To make the bottom row of diagram (7.14) algebraic, identify the Ricmann 
surface X\{N) with X\(N) a \^c, the complex points of the algebraic model 
over Q. The bottom row becomes 

T p : Div(X 1 (JV) algiC ) — ► Div(X 1 (JV) a i g , c ). 

We will see that this map is defined over Q, and so it restricts to Q-points, 

T p : Div(Ax(iV) alg ) — ► Div(Xi(AT) a i g ). 

This restriction is the desired algebraic version over Q of the bottom row 
of (7.14). 

The fact that T p on Div(A"i(A r ) a i gi c) is defined over Q is shown using the 
Curves-Fields Correspondence and Galois theory. For simplicity first consider 
the other Hecke operator (d) . The corresponding field map is its pullback (d) * , 
so to show that (d) is defined over Q it suffices to show that (d)* takes gen- 
erators of Q(Xi(Af) alg ) back to Q(X!(A0 alg ). Since Q(Xi(JV)aig) = Q(j,/i), 
the definition (d)(ri(JV)r) = A^hO) where 7 = [ a c b s ] e r 0 (N) with 
5 = d (mod N) shows that this means checking j(j(r)) and /i(7(t)) for 
such 7. Because ,7(7(1")) = j(r), only fi(j(r)) is in question. But as in Sec- 
tion 7.5, 

A(7(r)) ^ f^(l(r)) = f^(r) = / 0 «(r). 

Thus /i(7(t)) is the ^-coordinate of ±[d]Q T € making it an element 

of Q(j(T),E j(r) [N]). By definition Q(Xi(iV) alg ) is the fixed field of the Ga- 
lois subgroup K x such that p(K x ) = {±[g \}} C GL 2 (Z/NZ). This group 
preserves ±[d]Q T and therefore fixes /i(7(r)) as desired, and thus (d) is de- 
fined over Q. 

Recall the planar model Xi(N)^ na,r from Section 7.7, the ordered pairs 
(j, x) satisfying the polynomial relation between j( T ) an d /i(t)- In terms of 
this model, the Hecke operator is 

(d)(J,x) = (j,r d (j,x)) 

where is the rational function over Q of two variables such that x([d]Q T ) = 
r d{j(T~),x(Q T )). To provide the reader another opportunity to use the ideas 
of these last two paragraphs in an easy setting, Exercise 7.9.2 shows that 
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the operator wn on the modular curve X 0 (N) (cf. Exercise 1.5.4) is defined 
over Q. Then the more elaborate Exercise 7.9.3 shows that T p is also defined 
over Q as claimed. 

The sides of diagram (7.14) extend the map of Theorem 1.5.1, 

: Si(JV) — > X,(N), [C/A T , 1/N + A T ] ^ A(AT)r, 

to divisor groups. To make Vi algebraic, consider the commutative diagram 

Si(JV) ^Si(l) 

ipi 

Xx(N) >X 1 (1) 

given by 

[C/Ar, 1/N + A T ] I >[C/4 T ] 

A(iV)T I > SL 2 (Z)r. 

Recall that the complex analytic: moduli space Si(iV) identifies with the com- 
plex algebraic moduli space Si(-ZV) a ig,c, an d the Riemann surface Xi(N) iden- 
tifies with the complex points ^i(-/V) a ig,c of the algebraic modular curve. 
When N = 1, the complex algebraic moduli space is the equivalence classes 
of complex algebraic elliptic curves, and the complex points of the algebraic 
modular curve are the complex projective line P 1 (C). The diagram becomes 

-> Sl(l)alg,C 

(7.17) 

> ^l(l)alg,C 

given by 

[E,Q]\ >[E] 



p\ >j(E). 

Since our description of Xi(7V) a i g is abstract, we cannot write the maps 
down the left side and across the bottom row explicitly. Nonetheless, since 
an element [E,Q] of Si(7V) a i gi c describes an element of the algebraic moduli 
space Si(7V) a ig if and only if j(E) £ Q, as discussed earlier in the section, map- 
ping down and then across in diagram (7.17) takes Si(iV) alg to the algebraic 
modular curve -Xi(l) a i g = P : (Q). It follows that the point P in the diagram 



Si(AO a i g ,c 

*/>l,alg 

^iWalg.C 
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lies in the algebraic modular curve Xi(N) a \ g (Exercise 7.9.4(a)). That is, the 
left side of diagram (7. 17) restricts to the desired algebraic map 

V-i.aig : Si(iV) a i g — >Xi(JV) alg . 
We now have the algebraic counterpart over Q to diagram (7.14), 

Div(Si(AOai g ) — ^->Div(Si(J\Oaig) 



V'l.alg 



V'l.alg 



Div(X 1 (7V) alg ) — ^Div(Xi(J\Oalg)- 

The vertical maps restrict to degree-zero divisors and then the bottom row 
passes to Picard groups, giving a diagram we will need in Chapter 8, 



Div^SrtAOaig) Div^S^AOaig) 



*l,a 



(7.18) 



Pic 0 (X!(iV) alg ) Pic°(X 1 (N) &lg ). 
Similarly, starting from diagram (5.9), 



Si (TV) 



(d) 



>Si(JV) 



■01 



(d) 



jives another algebraic diagram we will need in Chapter 8, 
Div 0 (S!(iV) alg ) -^-> Div^S^AOaig) 



^l.alg 



Pic°(XiWaig) -^Pic^X^N)^). 



l/'l.alg 



(7.19) 



The map V'l.aig can be described more explicitly at the cost of losing some 
points along the way. Note that j(r) ^ {0, 1728} for all but finitely many 
points [C/A T , l/N + A T ] of the complex analytic moduli space S\(N), so that 
as in Section 7.5 an admissible change of variable takes E T to 



E j{r) : y 2 = 4x 3 



27j(^) 
j(t) - 1728 



27j(r) 
j(t) - 1728 



and takes (p T (l/N), p' T (l/N)) to Q T . That is, the bijection (7.15) from Si(iV) 
to Si(iV) a i g; c restricts to 
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[C/A T ,1/N + A T ] i ^ [^ (T ),g T ], i(r) i {0,1728}. 
Consider the diagram 

Si(JV)' >Sr(iV)^ c 



Ki(JV)'- -^^(JV)^ 

where the primes mean to remove finitely many points, the map across the 
top was just given, and the map across the bottom comes from the definition 
of the planar model, 



[C/Ar, l/N + A T \ I > [E j[T) ,Q T ] 



A(7V)Th 



^(j(r)Ji(r)). 



The complex algebraic map from the moduli space to the planar model of the 
modular curve comes from completing the diagram, 

[^•(r),Qr]^(j(r),/i(r)). (7.20) 

Obtaining V'l.aig from this uses one last commutative diagram, 

Yl (N)' >X 1 {N)$£g' 



Xt(N) >X 1 (N) aSStC - 

Here the map across the top row is from the previous diagram, the map 
down the left side is inclusion, the map down the right side is the birational 
equivalence from the planar model of the modular curve to the actual modular 
curve as discussed after Theorem 7.2.5, and the map across the bottom is the 
identification of the Riemann surface with the complex points of the algebraic 
modular curve. Since the birational map is undefined at finitely many points 
of the planar model, the primes in the top row now mean to remove finitely 
more points as necessary for all the maps to make sense. Thus V'l.aig on all 
but finitely many points of Si(_/V) a i gj c is described more explicitly than before 
as (7.20) followed by the birational map. Finally, the map from Si(7V)^ lg to 
the planar model can be described without reference to the larger complex 
analytic environment. It is 

+ *i(A0± anar , 



[E,Q]^(j(E),x(Q)), 



(7.21) 



where x(Q) is the x-coordinate of Q and again the prime means to preclude 
j = 0,1728. This follows from (7.20) since /i(r) = x(Q T ). We will use this 
description in Chapter 8. 
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7.9.1. Show that the intersection of an equivalence class in Si(iV) a ig j c with 
Si(iV) a ig is an equivalence class in S\(N) a \ g . 

7.9.2. From Exercise 1.5.4 the matrix [ ^ ~J] normalizes r 0 (N) and so it 
gives an involution 

w N : X Q (N) — ► X Q (N), r Q (N)r ^ r 0 (N)(-l/(Nr)). 

(a) Let t' = —\/{Nt). Explain why showing that is defined over Q 
reduces to showing that j(r') and Jat(t') lie in Q(X 0 (N) a i g ). Show this. (Hints 
for this exercise are at the end of the book.) 

(b) Now we know that the algebraic version of on the planar model 
of X 0 (N) defined by the minimal polynomial p 0 of /o over Q(j) is 

w N : X 0 (N)H*™ X 0 (N)^, (j, x) ^ (j> , x>), 

a map whose first component is a rational function rjy over Q in two variables. 
Describe a map 

SoW alg ^AVA0S nar 

analogous to (7.21). Explain why j(E/C) = r N (j(E), x(C)) where E is an 
elliptic curve over Q, C is an TV-cyclic subgroup of E, and x(C) is the sum of 
the ^-coordinates of its nonzero points. 

(c) Describe wn on the planar model of X a (N) defined by the modular 
equation <P(j, j^) — 0, cf. Section 7.5. What does this show about the poly- 
nomial <P(x, y)l 

7.9.3. This exercise shows that the Hecke operator T p is defined over Q. 

(a) The double coset configuration specialized to T p (cf. Section 5.2) in- 
volves the matrix a = [J °], the group A = A°(iV» = A (AT) n T°(p), and 
the modular curve X®(N,p) = -X"(A°(Af,p)). Similarly define 

r 1>0 (N, P ) = a (AO n r 0 (N P ), x 1>0 (n, p ) = x(r 1>0 (N, P )). 

Show that r lfi (N,p) = a/f (iV.pJa- 1 , i.e., F 3 = r lfi (N,p) and X' 3 = 
Xi^(N,p) in the double coset configuration. Show that the paragraph after 
Theorem 6.3.2 rephrases to say that T p is the pullback of the map 

X lfi (N,p) —tX^N), r h0 (N,p) T ^ r^pr (7.22) 

followed by the pushforward of the map 

X lfi (N,p) -^Xt{N), r lfi (N,p) T ^ A(JV)t. (7.23) 

The goal is to show that each of these maps is defined over Q. 

(b) Show that A(ATp) C r lfi (N,p) C r 0 (Np). Zoom in on Figure 7.2 and 
cite (7.13), both with Np in place of N, to obtain Figure 7.8 here, in which 
the intermediate field K on the right corresponds to C(Xifi(N,p)) on the left. 
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(Z/.\>Z)\/{±lj { C(X lfi (N,p)) 
C(X 0 (Np)) 



Q(X 1 (7Vp) alg ) 



K V (Z/7VpZ)*/{±l} 

Q(X 0 (7Vp) alg ) > 



Figure 7.8. Function fields for T p 



Explain why K is the function field of a curve over Q. Denote this 
curve Xi fi {N,p) a i & , so that Q(Xi ) o(iV,p) a i g ) = K. Since Xi(7Vp) a i giC is 
isomorphic over C to X\{Np), and similarly for Xo, Theorem 7.7.1 also 
gives a diagram as in Figure 7.9. Apply the Restriction Lemma twice to 
this figure to show that C(X li0 (iV,p) algi c) = C(X lj0 (iV,p)) and therefore 
Xi, 0 (-/V,p) a i g; c = X\fl{N,p) up to isomorphism over C. Thus X\fi(N,p) is 
defined over Q. 



CiXtiNp)) t 
C(X h0 (N,p) alg . c ) 
C(X 0 (Np)) 



Q(X 1 (Np) &lg ) 

Q(Xl, 0 (W,p)alg) 

Q(X 0 (iVp) alg ) 

Figure 7.9. Comparing C(Xi, 0 (iV,p) a i g ,c) and C(Xi j0 (iV,p)) 



Obtain the tower of fields and /3-images of subgroups of the Galois 
group Gal(Q(j, Ej[Np])/Q(j)) shown in Figure 7.10. (Hints for this exercise 
are at the end of the book.) 



QiX^Np)^) 
Q(X lfi (N,p) alg ) 
Q(X 0 (Np) als ) 



.{±[g^]eGL 2 (Z/ATpZ)} 

.{± [g»] e GL 2 (Z/NpZ) :d=l (mod iV)} 

.{[8S] eGL 2 (Z/N P Z)} 

Figure 7.10. Galois groups 



(c) The two function field injections corresponding to the maps (7.22) 
and (7.23) are 
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C(AyAO) — ► C(X lfi (N,p)), /(r) i ^ f(pr) 

and 

C(AVAO) — ► C(X li0 (JV,p)), /(r) ^ f(r). 

Since C(X 1 (A r )) = C(j, /i) the nontrivial map has image C(j(pr), /i(pr)). 
To show that the injections restrict to the function fields over Q it thus suffices 
to show 

j(pr) G Q(X lfi (N,p) alg ), h(pr) e Q(X 1;0 (7V,p) alg ). 

Use the group containment A,o(^p) C -To(p) to show the first of these. 

(d) Let t' = and let f = j(r'), both in Q(^i,o(-^>p)aig) an d thus 
in Q(j, Ej[Np\). Show that there is a morphism : £7j — > Ey with kernel 
(Q t .n), taking Q T ,Np to Q T ',n, and defined over some Galois extension L 
of Q(j, £j ;[Np]). Here the second subscript of each Q denotes its order. 

(e) Let K = Gal{q,{j,E j [Np})/q,{X lfi {N,p)^)). Any a e K extends 
to Gal(L/Q(Xi ; o(A r ,p) a i g )). Show that then ip a — ±<p and <p(Q T ,Np) — Qt',n- 
Deduce that f\{pr) £ Q(Xi i0 (N,p) a \ g ) as desired. 

7.9.4. (a) Let h : X — > Y be a surjective morphism over Q of algebraic curves 
over Q. Thus h extends to a surjective morphism h : X c — ► Y c where Ac 
and Yc are the complex points of X and Y. Show that if y € Y C Yc then 
all of its inverse image points x € Ac lie in X, as follows. Compute that for 
every automorphism a of C that fixes Q, also x a lies in the inverse image of y. 
Therefore x has only finitely many conjugates over Q, meaning its coordinates 
are algebraic over Q and thus elements of Q. How does this show that the 
point P from the section lies in A 1 (A f ) alg ? 

(b) Let E be an algebraic curve over Q. Apply part (a) to show that 
E[N] = (Z/NZ) 2 without quoting the structure result in Theorem 7.1.3. (A 
hint for this exercise is at the end of the book.) 
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The Eichler— Shimura Relation and L-functions 



This chapter arrives at the first version of the Modularity Theorem stated 
in the preface to the book: For any elliptic curve E over Q there exists a 
newform / such that the Fourier coefficients a p (f) are equal to the solution- 
counts a,p(E) of a Weierstrass equation for E modulo p. Gathering the a p (f) 
and a p (E) into L- functions, this rephrases as 

L(s,f)=L(s,E). 

The techniques that relate this to other versions of Modularity involve working 
modulo p and expressing both a p (E) and a p (f) in terms of the Frobenius map, 

The key is the Eichler-Shimura relation, expressing the Hecke operator T p in 
characteristic p in terms of a p . Since reducing algebraic curves and maps from 
characteristic 0 to characteristic p is technical, the chapter necessarily quotes 
many results in quickly sketching the relevant background. The focus here is 
on the Eichler-Shimura relation itself in Section 8.7 and on its connection to 
Modularity in Section 8.8. 

Chapter 7 transferred the Modularity Theorem from analysis to algebraic 
geometry and changed its underlying field of definition from C to Q. Now 
that the objects are algebraic we drop "alg" from the notation, so 

• Xi(N) denotes the modular curve as a nonsingular algebraic curve over Q, 
with function field Q(Xi(AT)) = Q(j, fr). 

• S\(N) denotes the moduli space consisting of equivalence classes [E,Q] 
where E is an algebraic elliptic curve over Q, Q e E is a point of order N, 
and the equivalence relation is isomorphism over Q. 

Also, the Jacobian of a compact Ricmann surface X is identified with the 
Picard group of X, both denoted Pic°(X). The notation Jac(X) will no longer 
be used. 

Related reading: Chapter 7 of [Shi73], Rohrlich's article in [CSS97], [Sil86], 
[Kna93]. 
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8.1 Elliptic curves in arbitrary characteristic 

Much of the material from Section 7.1 on elliptic curves in characteristic 0 
is also valid in characteristic p. Let k be an arbitrary field. A Weierstrass 
equation over k is any cubic equation of the form 

E : y 2 + aixy + a 3 y = x 3 + a 2 x 2 + a 4 x + a e , di,...,a 6 ek. (8.1) 

To study this, define 

b 2 = a\+Aa 2 , b 4 = a 1 a 3 + 2a 4 , 6 6 = a| + 4a 6 , 
b$ = a\a§ — 010304 + 0203 + 4o206 — a 4 , 

and define the discriminant of the equation to be 

A = -0% - 8b 3 - 21b\ + 9b 2 b 4 b e . 

Further define 

c 4 = b\-2Ab 4 , c 6 = -b\ + 366 2 &4 - 216& 6 , 
and if A ^ 0 define the invariant of the equation to be 

3 = 4/ A. 

Then all bi G k, A G k, all a € k, and j £ k when it is defined. Also, 4& 8 = 
b 2 b 6 — b\ and 1728Z\ = c\ — c\. (Confirming the calculations in this paragraph 
is Exercise 8.1.1(a).) If k does not have characteristic 2 then replacing y 
by y — (a\x + 0,3) / 2 in (8.1) eliminates the xy and y terms from the left side, 
reducing the Weierstrass equation to 

E :y 2 = x 3 + (b 2 x 2 + 26 4 a; + 6 6 )/4, b 2 , b 4 , h e k, char(k) ^ 2. (8.2) 

If k does not have characteristic 2 or 3 then replacing x by (x — 362)/36 and y 
by y/216 in (8.2) eliminates the x 2 term from the right side, further reducing 
the Weierstrass equation to the form 

E :y 2 = x 3 - 27c 4 x - 54c 6 , c 4 , c 6 £ k, char(k) £ {2, 3}. (8.3) 

Since (8.2) and (8.3) are special cases of (8.1), the coefficients of a Weierstrass 
equation will be referred to as the a* in all cases. 
As before, 

Definition 8.1.1. Let k be an algebraic closure of the field k. When a Weier- 
strass equation E has nonzero discriminant A it is called nonsingular and 
the set 

2 

E = {(x, y) £ k satisfying E(x, y)} U {00} 
is called an elliptic curve over k. 
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Note that an elliptic curve over k has infinitely many points even when k 
is a finite field. 

The general admissible change of variable is 

x = u 2 x' + r, y = u 3 y' + su 2 x' + t, ii,r,s,iek, 0. 

These form a group, and they transform Weierstrass equations to Weierstrass 
equations, taking the discriminant A to A/u 12 and preserving the invari- 
ant j (Exercise 8.1.1(b)). In particular the changes of variable between Weier- 
strass equations (8.1), (8.2), and (8.3) are admissible, and thus we may work 
with (8.3) if char(k) <£ {2, 3} and with (8.2) if char(k) ^ 2, needing the gen- 
eral (8.1) only when char(k) = 2 or when there is no assumption about the 
characteristic — for example, Section 8.3 will discuss reducing a Weierstrass 
equation (8.1) over Q modulo an arbitrary prime p. Since the changes of vari- 
able from (8.1) to (8.2) to (8.3) have u = 1 and u — 1/6 respectively, the 
discriminant of (8.3) is 6 12 times the discriminant of (8.1) and (8.2), i.e., 
A (8 . 3) =2^(4-c 2 ). 

Somewhat awkwardly, the cubic equation y 2 = Ax 3 — g 2 x — g 3 from Chap- 
ter 7 is no longer a Weierstrass equation by our new definition since the 
coefficients of y 2 and x 3 are unequal, and replacing y by 2y to put it in the 
form (8.3) is an inadmissible change of variable. Both sorts of cubic equation 
can be encompassed in more general definitions of Weierstrass equation and 
admissible change of variable, but since normalizing the coefficients of y 2 and 
x 3 to 1 simplifies the formulas of this chapter we accept the small inconsistency 
in terminology instead. Modulo the inadmissible substitution, the earlier defi- 
nitions of the discriminant and the invariant for equations y 2 = Ax 3 — g 2 x — g 3 
are the same as their definitions for (8.3) (Exercise 8.1.1(d)). 

In particular, replacing y by 2y in the universal elliptic curve (7.12) leads 
to the Weierstrass equation 

with discriminant 2 6 3 12 j 2 /(j — 1728) 3 and invariant j. An admissible change 
of variable then gives a more general universal curve (Exercise 8.1.1(e)) 

f + *V = * 3 - (j^rW)^ (j3l72s)' (8 ' 5) 

with discriminant j 2 /(j — 1728) 3 and invariant j. The curve (8.5) is well suited 
for fields of arbitrary characteristic since its discriminant is nonzero even in 
characteristic 2 or 3. Section 8.6 will use it to define modular curves in prime 
characteristic as Chapter 7 used the universal elliptic curve to define modular 
curves over Q. 

Most of the results from Section 7.1 hold in arbitrary characteristic. The 
Weierstrass polynomial associated to (8.1) is 
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E{x,y) = y + a\xy + a 3 y — x — a 2 x — a 4 .x — a 6 G k[x, y], (8.6) 

and similarly for (8.2) and (8.3). In all cases the Weierstrass equation is non- 
singular if and only if the corresponding curve E is geometrically nonsingular, 
i.e., the gradient of the Weierstrass polynomial never vanishes. Verifying this 
in characteristic 2 requires a different argument from the one given before 
(Exercise 8.1.2). Again an elliptic curve E lies in P 2 (k), it forms an Abclian 
group with the infinite point [0,1,0] (see Exercise 8.1.3(a)) as its additive 
identity 0#, and the addition law is that collinear triples sum to 0e- Opposite 
pairs of points P and — P have the same x-coordinate, at most two points 
with the same x-coordinate satisfy (8.1), and so any two points with the same 
x-coordinate are equal or opposite, possibly both. Since the y-values satisfy- 
ing (8.1) for a given x sum to — a\x — a 3 the additive inverse of P = (xp,yp) 
is the natural companion point 

-P = (x P , -y P - aix P - a 3 ). 

Given points P and Q of E, let R be their third collinear point. The addition 
law P + Q = —R says that the sum is the companion point of R, 

if P, Q, R are collinear then P + Q = {x R , —y R — a x x R — a 3 ). (8.7) 

Let k pr i mo denote the prime subfield of k, meaning the smallest subficld in- 
side k, either the rational numbers Q if char(k) = 0 or the finite field F p of 
order p if char(k) = p. Then the group law is defined by rational functions r 
and s over the field k pr i me ({aj}) where the ctj are the Weierstrass coefficients. 
If xq — xp and yq = —yp — a\Xp — a 3 then Q — —P and so P + Q = 0e- 
Otherwise P + Q lies in the affine part of E. Let 



and 



vq - yp 

x Q - x P 

3xp + 2a 2 xp + a 4 — a\yp 
a\x P + a 3 + 2y P 



ypXQ - y Q x P 

x Q - x P 
—xp + a^xp + 2a 6 - a 3 y P 
aixp + a 3 + 2y P 



xp ^ x Q , 
x P = x Q , 

xp ^ x Q , 
x P = XQ. 



The line y — Xx + \x passes through P and Q when P ^ Q and is the tangent 
line to E at P when P = Q (Exercise 8.1.4(a)). The rational functions giving 
x P+Q and y P+ Q are 

r{xp,x Q ,yp,y Q ) = A 2 + a 1 \- a 2 - x P - xq, 
s(xp,x Q ,yp,y Q ) = -(\ + a 1 )r(xp,XQ,yp,y Q ) - fj, - a 3 . 
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This algebraic definition of addition corresponds to the geometric descrip- 
tion (8.7) (Exercise 8.1.4(b)). As before, the casewise expressions for A and /i 
arise from a single rational function (Exercise 8.1.4(c)). For any algebraic 
extension K/k the set of K-points of E is a subgroup of E, 

E(K) = {PeE- {0 e } : (x P ,y P ) e K 2 } U {0 E }. 

Let N be a positive integer. The structure theorem for the A^-torsion 
subgroup E[N] = ker([7V]) of an elliptic curve is 

Theorem 8.1.2. Let E be an elliptic curve over k and let N be a positive 
integer. Then 

E[N] = Y[ E[p e "] where N = JJ p e " . 

Also, 

E[p e ] S (Z/p e Z) 2 ifp^ char(k). 
Thus E[N] (Z/NZ) 2 i/char(k) \ N. On the other hand, 



E[p e ] S Z/p e Z for o//e>l' 



or 



E[p e ] = {0} /or all e > 1 



i/p = char(k). 



In particular, if char(k) = p then cither E[p] = Z/pZ, in which case E is 
called ordinary, or E[p] = {0} and E is super singular. 

In this chapter we will also need to know a bit about singular Wcicrstrass 
equations. As already mentioned, the condition A = 0 is equivalent to the con- 
dition that for some point P satisfying the Weierstrass polynomial (8.6), both 
partial derivatives vanish at P. The projective point [0, 1,0] is always nonsin- 
gular (this was Exercise 8.1.3(b)), so any such P is affinc. The coordinates 
of P lie in k (Exercise 8.1.5(a)), so an admissible change of variable over k 
takes P to (0,0). Then the conditions £(0,0) = L>i£(0,0) = D 2 E(0,0) = 0 
force the Weierstrass polynomial to be 

E{x,y) = y 2 + a x xy - x 3 - a 2 x 2 . (8.9) 

If char(k) ^ 2 then letting y = y + aix/2 in (8.9) simplifies this to 

E(x,y) = y 2 -x 3 - a' 2 x 2 . (8.10) 

The point P = (0, 0) is the only singular point satisfying E. If char(k) = 2 
then this is easy to verify from (8.9), while if char(k) ^ 2 then it follows 
from (8.10) (Exercise 8.1.5(b)). 
Rewrite (8.9) as 



E(x, y) = (y- m\x)(y - m 2 x) - x 3 . 



(8.11) 



314 



8 The Eichler-Shimura Relation and L-functions 



The singular point P is called a node if mi 7^ m 2 , meaning that two distinct 
tangent lines pass through the curve at P. and it is called a cusp if mi = m 2 : 
when there is only one tangent line. (See Figure 8.1.) Working from (8.9) 
and (8.11), it is easy to compute that C4 = (mi — m 2 ) 4 (Exercise 8.1.5(c)), 
so that the curve has a node if c 4 7^ 0 and a cusp if C4 = 0. These conditions 
apply to the Weierstrass equation in its original form since the admissible 
change of variable translating the singular point P to (0, 0) multiplies c 4 by 
a nonzero scalar, cf. Exercise 8.1.1(c). In sum, 

Proposition 8.1.3. Let E be a Weierstrass equation overk. Then 

• E describes an elliptic curve A 7^ 0, 

• E describes a curve with a node A = 0 and c 4 7^ 0, 

• E describes a curve with a cusp -<==^> A — 0 and c 4 = 0. 

In the case of a node the set of projective solutions of E other than the singular 
point forms a multiplicative group isomorphic to k , and in the case of a cusp 
the set forms an additive group isomorphic to k. (See [Sil86] for the proof 
of this, and also see Exercise 9.4.2.) The notions of ordinary, supersingular, 
multiplicative, and additive Weierstrass equations in characteristic p will all 
figure in this chapter. 



Figure 8.1. Node and cusp 



Exercises 

8.1.1. (Suggestion: Don't do this problem by hand.) 

(a) Confirm that 46 8 = b 2 b 6 - b\ and that 1728Z\ = c\ — c%. Confirm that if 
char(k) 7^ 2 then replacing y by y — {a\x + a%) /2 in (8.1) gives (8.2). Confirm 
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that if char(k) £ {2,3} then replacing (x,y) by ((x - 36 2 )/36, y/216) in (8.2) 
gives (8.3). 

(b) Show that the admissible changes of variable x = u 2 x' + r, y = 
u 3 y' + su 2 x' + t where u,r,s,t £ k and u ^ 0 form a group. Show that 
every admissible change of variable transforms a Weierstrass equation E of 
the form (8.1) into another such equation E' with 

ua[ = ai + 2s, 
u 2 a' 2 = a 2 — sai + 3r — s 2 , 
u 3 a 3 = a 3 + ra\ + 2i, 

u A a! 4 = 04 — SC13 + 2ra2 — (rs + f)ai + 3r 2 — 2st, 
u 6 a' 6 = a 6 + ra 4 — ta 3 + r 2 a 2 — rta\ + r 3 — t 2 1 

and 

M 2 fe2 = b 2 + 12r, 

u 4 ^ = 6 4 + r6 2 + 6r 2 , 

u 6 ^ = 6 6 + 2r6 4 + r 2 6 2 + 4r 3 , 

u s b' s = b s + 3rb 6 + 3r 2 fe 4 + r 3 b 2 + 3r 4 , 

and 

w 4 c 4 = c 4 , u 6 Cg = c 6 , 

and 

u 12 zi'^A / = j. 

(Hints for this exercise are at the end of the book.) 

(c) Suppose the seemingly more general change of variable x — vx' + r, 
y = wy'+vsx'+t, where v, «i,r,s,(€k and v, w are nonzero, takes Weierstrass 
equations to Weierstrass equations. Show that v = u 2 and w = u 3 for some 

nek. 

(d) Replace y by y/2 in the third Weierstrass equation (8.3) to get 

y 2 = 4x 3 - g 2 x - g 3 , g 2 = 108c 4 , g 3 = 216c 6 . 

Show that the previously defined discriminant Z\ 0 m = g 2 — 27gf| of this equa- 
tion is equal to the discriminant Z\(s.3) = 2 6 3 9 (c 4 — Cg) of (8.3) in this section. 
Show that the previously defined invariant j — 1728^2 /^oid is equal to the 
invariant j = c 3 /A in this section. 

(e) Find an admissible change of variable taking the modified universal 
elliptic curve (8.4) to the more general universal elliptic curve (8.5). Confirm 
that the discriminants and invariants of the two universal curves are as stated. 



8.1.2. Show that algebraic and geometric nonsingularity are equivalent in 
characteristic 2. 
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8.1.3. (a) Homogenize the general Weierstrass polynomial by adding in pow- 
ers of z to make each term cubic, 

-Bhom(a;, V, z) = y 2 z + a\xyz + a 3 yz 2 — x 3 — a 2 x 2 z — a^xz 2 — a e z 3 . 

Show that [0, 1, 0] satisfies -Ehom and no other [x, y, 0] does, 
(b) Dehomogenize i^hom by setting y = 1 to obtain 

E(x, z) — z + a\xz + a 3 z 2 — x 3 — a 2 x 2 z — a^xz 2 — a e z 3 . 

In the (x, z) coordinate system, the infinite point Ob is (0, 0). Working in this 
affinc coordinate system, show that E is geometrically nonsingular at 0^. 

8.1.4. (a) Show that the casewise definitions of A and fi make the line y = 
\x + n the secant line through P and Q when P ^ Q and the tangent line 
to E through P when P = Q. 

(b) Show that the geometric and algebraic descriptions (8.7) and (8.8) of 
the group law agree. 

(c) Multiply the numerator and the denominator of the secant case A by 
Vq + Vp + a-ixp + &3 and use the Weierstrass equation (8.1) to obtain a new 
expression for A when yQ + yp + a\Xp + a% ^ 0. Show that this also agrees 
with the old A when xp = xq, suitably giving oo when P = —Q. Similarly 
derive a new expression for fi. 

8.1.5. (a) Let P be a singular point of a Weierstrass equation E over k. Show 
that the coordinates of P lie in k. (A hint for this exercise is at the end of the 
book.) 

(b) Show that if char(k) = 2 then (0, 0) is the only singular point satisfying 
the Weierstrass polynomial (8.9), and if char(k) ^ 2 then (0,0) is the only 
singular point satisfying the Weierstrass polynomial (8.10). 

(c) Show that C4 = (mi — rri2) A in the context of equations (8.9) and (8.11). 

8.1.6. (a) For what values a, b e Q do the Weierstrass equations 

y 2 = x 3 + ax 2 + bx, y 2 = x 3 - 2ax 2 + (a 2 - 4b)x 

both define elliptic curves E and E' over Q? 

(b) For such values a and b show that the map 

(x,y) 1 y (y 2 /x 2 ,y(b-x 2 )/x 2 ) 

defines a map ip : E — ► E' taking Ob to 0e i ■ 

(c) Compute ker(^>) and compute y _1 (0,0). 

(d) Find the dual isogeny -0 : E' — > E, verifying the compositions ipoip = 
[2] E and ipoi/j= [2] E >. 
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8.2 Algebraic curves in arbitrary characteristic 

This section discusses algebraic curves over an arbitrary field k. The definition 
is as in Section 7.2. Given polynomials ipi, . . . , ip m € k[xi, . . . , x n ] such that 
the ideal 

I = {ip 1 ,...,<p m ) Ck[xi,...,x„] 

is prime, let 

C = {P e k" : ip(P) = 0 for all tp g /}. 

The function field k(C) of C over k is the quotient field of the coordinate 
ring k[C] = k[xi, . . . , x n ]/I. If k(C) is a finite extension of a field k(t) where 
t is transcendental over k then C is an affinc algebraic curve over k. If for 
each point P g C the derivative matrix [Dj^P)] has rank n — 1 then C is 
nonsingular. Especially if C is defined by one equation in two variables then 
it is nonsingular if the gradient never vanishes. The projective version Chom 
of the curve is defined as before. A nonempty algebraic curve over k contains 
infinitely many points even when k is finite. 

The Curves-Fields Correspondence is also unchanged. The map 

C i y k(C) 

induces a bijection from the set of isomorphism classes over k of nonsingular 
projective algebraic curves over k to the set of conjugacy classes over k of 
function fields over k. And for any two nonsingular projective algebraic curves 
C and C over k, the map 

(h : C — > C) ^ (h* : k(C") — > k(C)) 

is a bijection from the set of surjective morphisms over k from C to C to 
the set of k-injections of k(C") in k(C). Again nonconstant morphisms are 
surjective. 

Let p be prime. Along with the parallels to Section 7.2 just described, 
algebraic curves and their function fields exhibit new phenomena in charac- 
teristic p. Let F p denote the field of p elements and let F p denote its algebraic 
closure. For any positive integer power q = p e of p there is a unique field F q 
of order q in F p , the splitting field of the polynomial x q — x over F p (Exer- 
cise 8.2.1). The algebraic closure is the union of all such F q . This is because 
every clement of F p lies in a finite extension field K of F p , a finite-dimensional 
vector space over F p that therefore has order q = p e for some e, and thus 
K = F q by uniqueness. 

The Frobenius map on F. p is 

ct<p • ' y Fp . x i y x 

Because [x + x) p — x p + x p in characteristic p, the Frobenius map is a field 
automorphism. Its inverse is thus again an automorphism of F p but is not a 
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polynomial function. Its fixed points are F p , the roots in F p of the polynomial 
x p = x, and more generally the fixed points of a p (e-fold composition where 
e > 1) are F q where q — p e . For each such q the field extension F q /F p is Galois 
and its Galois group is cyclic of order e, generated by a p , but the group of 
automorphisms of F p is not cyclic. 
The Frobenius map on F™ is 

Cp : Fp > Fp , (xi , ■ • ■ , Xn) i— > (x^ , . . . , x p n ) . 

This is a bijection. Its fixed points are F™. It induces a well defined bijection 
at the level of projective classes, 

u p : P"(F P ) — ► P"(F P ), [x 0 , xi, . . . , x n \ ^ [x p 0 , x\, . . . , <], 

with fixed points P™(F p ). 

Let (p(x) = ~^2 e a e x e G F p [xo,Xi, . . . ,x n ] be a homogeneous polynomial 
(where x e is shorthand for ), and let Lp ap be the polynomial obtained 

by applying the Frobenius map to its coefficients, ip ap (x) — ^ e aZ v x e . Then 
(Exercise 8.2.2) 

l p"p{x"p) = l p{xYp. (8.12) 

This shows that if C is a projective curve defined over F p by polynomials ipi 
and if C ap is the corresponding curve defined by the polynomials tp°" then a p 
restricts to a morphism from C to C a " . That is, if P £ P™(F p ) and ¥>i(P) = 0 
for all i then <p° F (P ap ) = 0 for all i as well. Especially, if C is defined over F p 
then C ap = C and a p defines a morphism from C to itself. Summarizing, 

Definition 8.2.1. Let C be a projective curve over F p . The Frobenius map 

on C is 

o-p-.C — > C 7 ", [x 0 ,xi, ...,x n ] i-> [»o,a^, . . .,x p n ]. 

For example, the Frobenius map on P 1 (F p ) is cr p (t) = t p on the affine part, 
and so the induced map a* gives the extension of function fields K/k where 

K = F p (i), k = F p (s), s = t p . 

Thus K = k(f). The minimal polynomial of t over k is x p — s, so the function 
field extension degree is p even though the Frobenius map bijects and we 
therefore might expect its degree to be 1. Since x p — s = (x — t) p over K, the 
extension is generated by a pth root that repeats p times as the root of its 
minimal polynomial. 

Similarly the Frobenius map on an elliptic curve over F p is a p (u, v) = 
(u p ,v p ) on the affine part, so that a* gives the extension of function fields 
K/k where 



K = F p (u)[v]/(E(u,v)), k = F p (s)[t]/(E(s,t)), s = u p ,t = v p . 
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Thus K = k(u, v). The minimal polynomial of u in k[x] is x p — s, and this 
factors as (x — u) p over k(u). The minimal polynomial of v in k(u)[y] divides 
E(u,y), a quadratic polynomial in y. So 

[k(u):k]=p and [k(u, : k(u)] e {1, 2}. 

A similar argument shows that 

[k(» : k] = p and [k(u, w) : k(v)] e {1, 3}. 

Therefore K = k(u) = k(u) and [K : k] = p. Again the function field exten- 
sion degree is p even though the Frobenius map is a bijection, and again the 
extension is generated by a pth root that repeats p times as the root of its 
minimal polynomial. 

Definition 8.2.2. An algebraic extension of fields K/k is separable if for 

every element uofK. the minimal polynomial of u in k[x] has distinct roots 
in k. Otherwise the extension is inseparable. A field extension obtained by 
adjoining a succession of pth roots that repeat p times as the root of their 
minimal polynomials is called purely inseparable. 

Since a polynomial has a multiple root if and only if it shares a root with its 
derivative, every algebraic extension of fields of characteristic 0 is separable. 
Every algebraic extension K/k where k is a finite field is separable as well. 
The examples given before Definition 8.2.2 are purely inseparable. 

Let h : C — 5- C be a surjective morphism over F p of nonsingular pro- 
jective curves over F p . Let k = F p (C) and K = F p (C), so that the induced 
Fp-injection of function fields is h* : k — ► K. The field extension K//i*(k) 
takes the form 

ft*(k) C k sep C K 

where k sep //i*(k) is the maximal separable subextension of K//i*(k) (this 
exists since the composite of separable extensions is again separable) and 
thus K/k scp is purely inseparable. Factoring h* : k — > K as 

K ep hT 
k >k sep 

gives a corresponding factorization of h : C — > C , 

c ^C sop 

where the first map is h- lna — cr p with p e = [K : k sep ]. That is, 

h = h scp o o~p . 

The morphism h is called separable, inseparable, or purely inseparable accord- 
ing to whether the field extension K//i*(k) is separable (i.e., e = 0), insepa- 
rable (e > 0), or purely inseparable (h sep = 1). As in Chapter 7, the degree 
of h is 
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deg(h) = [K : h*(k)}. 
The separable and inseparable degrees of h are 

deg sop (/i) = degOscp) = [k scp : ft*(k)], 
deg ins (/i) = deg(/i ins ) = [K : k sop ], 

so that 

deg(h) = deg scp (/i) deg ins (/i). 
In characteristic p the degree formula from Chapter 7 remains 

e P (/i) = deg(/i) for any Q e C". 

P£h~i{Q) 

In particular, 

X] ep(h sep ) = deg scp (/i) for any Q <= C", 

P€h 8 ~ep(<3) 

and the ramification index ep(h scp ) <E Z + is 1 at all but finitely many points. 
Since /i; ns bijects it follows that deg sop (h) = \h~J^(Q)\ = \h~ 1 (Q)\ for all but 
finitely many Q £ C . This description applies even when h is not surjective 
(so that h maps to a single point and has degree 0), and this description 
shows that separable degree is multiplicative, i.e., if h! : C — > C" is another 
morphism then deg sep (h'oh) — deg sep (/i') deg sep (h). Consequently inseparable 
degree is multiplicative as well since total degree clearly is. 

We have seen that the Frobenius map a p is purely inseparable of degree p 
on P 1 (F p ) and on elliptic curves E over F p , and in fact this holds on all curves. 
On elliptic curves, where there is a group law, the map a v — 1 makes sense, 
and it is separable: for otherwise a v — 1 = / o a p where / : E — > £ is a 
morphism, so 1 — g o o v where .9=1—/, giving a contradiction because a p 
is not an isomorphism. 

The map [p] is an isogeny of degree p 2 on elliptic curves over F p . (This 
statement will be justified later by Theorem 8.5.10.) As a special case of the 
degree formula, if ip : E — 5- E' is any isogeny of elliptic curves then 

deg sep (^) = |ker(^)|. (8.13) 

By this formula and by the structure of ker([p]) = E[p] as {0,e} or Z/pZ, 
deg sep ([p]) is 1 or p. Thus [p] is not separable, and so it takes the form [p] = 
f o <7p for some rational /. This shows that although the inverse a p l of the 
Frobenius map is not rational, the dual isogeny a p = f is. If E[p] = {0e} 
then [p] is purely inseparable of degree p 2 and so a p — i o a p where i is 
an automorphism of E, while if E[p] = Z/pZ then [p] has separable and 
inseparable degrees p and so a p is separable of degree p. 
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This section ends by deriving commutativity properties of the induced 
forward and reverse maps of the Frobenius map. Let C be a projective curve 
over F p . The forward induced map of o p on C acts on divisors in the usual 
way as 

a p ,* : (P) i ^ (<t p (P)). (8.14) 

Since a p is bijective and is ramified everywhere with ramification degree p, 
the reverse induced map acts on divisors as 

a;:(P)^p(a-\P)). (8.15) 

Let h : C — > C be a map over F p of projective curves over F p . Then the 
Frobenius map commutes with h. That is, if cr Pi c denotes the Frobenius map 
on C and similarly for C then (8.12) shows that 

ho <j p C — <j p C , o h. (8.16) 

It follows that the forward induced map of the Frobenius map commutes with 
the forward induced map of h, 

h* ° {v P ,c)* = {c P ,C>)* ° h *- ( 8 - 17 ) 
Since the Frobenius map commutes with h as in (8.16) so does its inverse, for 

h ° a p,c = n ph' ° a p> c ' oho a P,c 

= a p,c> oho a vfi o <y p }j = a p.c" ° h - 

Now compute for any P e C, 

(h*°v* p ,c)(P) =P(hoa- 1 c )(P) by (8.14) and (8.15) 

= p(a- 1 c ,oh)(P) by (8.18) 

= «,c ° by ( 8 - 14 ) and ( 8 - 15 ) a g am - 

Thus the reverse induced map of the Frobenius map also commutes with the 
forward induced map of h, 

t l *°<7p < c = < T p,C' 0 I 1 *- (8.19) 

We will use formulas (8.17) and (8.19) in Section 8.7 and in Section 8.8. 

The only place in Section 7.2 that uses characteristic 0 is the argument that 
a curve has a planar model. This also holds in characteristic p when the under- 
lying field k is perfect, meaning every element has a pth root in the field. For 
example, finite fields and their algebraic extensions are perfect. Exercise 8.2.3 
sketches the argument for readers equipped with the armamentarium of field 
theory in characteristic p. 



(8.18) 
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Exercises 

8.2.1. Let q = p e . Show that the solutions of x q — x in F p form a subficld, 
and that this is the unique subfield of q elements. (A hint for this exercise is 
at the end of the book.) 

8.2.2. Verify (8.12). 

8.2.3. Let L/k(t) be a finite extension where t is a variable and k is perfect. 
This exercise sketches the argument that the extension is primitive. Supply 
details as necessary. The Steinitz Criterion states that an extension is primitive 
if and only if there are only finitely many intermediate fields. Thus to prove 
that L/k(t) is primitive we may replace L by its normal closure over k(t). So 
assume that the finite extension L/k(£) is also normal. 

The extension decomposes as k(£) C Ki C L, where Ki = k(i) scp so that 
the first extension is the maximal separable subextension of L/k(i) while the 
second is purely inseparable. But also, let K 2 be the subfield of L fixed by the 
automorphism group Aut(L/k(i)). Then k(£) C K 2 C L, and because L/k(i) 
is normal the first extension is the maximal purely inseparable subextension 
of L/k(i) while the second is separable. These two decompositions of the 
extension L/k(t) show that L = KiK 2 . 

Since the extension Ki/k(t) is separable, it has a primitive element, i.e., 
Ki = k(£)(a). Since k is perfect, K 2 takes the form k(t)(b) where b q = t, 
q = p e for some e. Thus L = k(£)(a, b), where a is separable over k(f). Since 
a + b has the same number of conjugates under Aut(L/k(£)) as a does, and 
this is the separable degree of k(t)(a + b)/k(t), it follows that k(t)(a + b) 
contains k(i)(a). Consequently k(f)(a + b) = k(i)(a, b) = L. That is, the 
extension L/k(i) is primitive. 

8.3 Elliptic curves over Q and their reductions 

This section discusses reducing elliptic curves defined over Q modulo a 
prime p. Consider a general Weierstrass equation E defined over Q, 

E : y 2 + aixy + a 3 y = x 3 + a 2 x 2 + a 4 x + a 6 , a\, . . . , a& G Q, 

and consider admissible changes of variable over Q. In particular the change of 
variable (x, y) = (u 2 x' , u 3 y') gives a Weierstrass equation E' with coefficients 
a! { = ai/u l . A suitable choice of u makes this an integral equation, so we 
now assume that the original coefficients Oj are integral. View two integral 
Weierstrass equations as equivalent if they are related by an admissible change 
of variable over Q. 

For any prime p and any nonzero rational number r let v p {r) denote the 
power of p appearing in r, i.e., v p {p e ■ m/n) = e e Z where p \ ran. Also define 
f p (0) = +oo. This is the p-adic valuation, analogous to the valuations from 
Section 7.2 in that for all r,r' G Q, 



8.3 Elliptic curves over Q and their reductions 323 

v p (rr') = v p {r) + v p {r'), 
v v (r + r') > mm{v p (r), v p (r')} with equality if v p {r) ^ v p {r'), 

but occurring now in the context of number fields rather than function fields. 
For each prime p let v p (E) denote the smallest power of p appearing in the dis- 
criminant of any integral Weierstrass equation equivalent to E, the minimum 
of a set of nonnegative integers, 

v p {E) — mva.{v p (A(E')) : E' integral, equivalent to E}. 

According to Exercise 8.1.1(b), admissible changes of variable lead to c' 4 = 
Czt/u 4 , Cg = c 6 /u 6 , and A' = A/u 12 . Thus if v p (ci) < 4 or v p {cq) < 6 or 
v p (A) < 12 then v p (E) = v p (A). The converse holds as well for p > 3 (Exer- 
cise 8.3.1). Define the global minimal discriminant of E to be 

A min (E) = l[p^ E \ 

p 

This is a finite product since v p (E) = 0 for all p \ A(E). A little work (Ex- 
ercise 8.3.2) shows that the p-adic valuation of the discriminant can be mini- 
mized to v p (E) simultaneously for all p under admissible change of variable. 
That is, E is isomorphic over Q to an integral model E' with discriminant 
A(E') = A m i n (E). This is the global minimal Weierstrass equation E' , the 
model of E to reduce modulo primes. From now on we freely assume when 
convenient that elliptic curves over Q are given in this form. 

The field F p of p elements can be viewed as Z/pZ. That is, F p is the image 
of a surjective homomorphism from the ring of rational integers Z, reduction 
modulo pZ, 

:Z — ►Fp, h = n + pZ. (8.20) 

This map^educes a global minimal Weierstrass equation E to a Weierstrass 
equation E over F p , and this defines an elliptic curve over F p if and only if 
p \ A m - m (E). The reduction of E modulo p (also called the reduction of E 
at p) is 

1. good [nonsingular, stable] if E is again an elliptic curve, 

a) ordinary if E[p) =S Z/pZ, 

b) supersingular if E[p] — {0}, 

2. bad [singular] if E is not an elliptic curve, in which case it has only one 
singular point, 

a) multiplicative [semistable] if E has a node, 

b) additive [unstable] if E has a cusp. 

These reduction types are all independent of how E is put into global mini- 
mal form. This is because if E has two global minimal Weierstrass equations 
then the admissible change of variable that takes one to the other has co- 
efficients r,s,t,u e Z with u = ±1 (Exercise 8.3.3), preserving A and C4. 
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Good reduction at p means i' p {A m i n (E)) = 0, and when this holds the two 
reduced Weierstrass equations differ by the reduced admissible change of vari- 
able, making them isomorphic elliptic curves with the same p-torsion and thus 
preserving ordinary or supersingular reduction as well. Bad reduction occurs 
when v p {A m \ n {E)) > 0, and either v v (c^) — 0 for multiplicative reduction or 
v p (ci) > 0 for additive reduction, cf. Proposition 8.1.3. The reason for the 
terms "multiplicative" and "additive" was also explained at the end of Sec- 
tion 8.1: in these cases the nonsingular points of E form a multiplicative group 
isomorphic to F p or an additive group isomorphic to F p . "Semistable" is also 
used to mean good or multiplicative but not additive, i.e., not unstable. This 
will be explained in the next section. 

Recall from Section 7.7 that the analytic conductor of an elliptic curve E 
over Q is the smallest integer TV such that X a (N) maps to E per the Modular- 
ity Theorem, i.e., the map is a surjective morphism over Q of curves over Q. 
Putting E into global minimal form provides an almost complete description 
of a related integer Ne called the algebraic conductor of E. The global minimal 
discriminant and the algebraic conductor are divisible by the same primes, 



so that E has good reduction at all primes p not dividing Ne- More specifically, 
the algebraic conductor takes the form Ne — Yl„P fp where 



Here 82 < 6 and S3 < 3. An algorithm for the algebraic conductor due to Tate 
and a formula for the algebraic conductor due to Ogg and Saito are presented 
in [Sil94]. The algebraic conductor, like the analytic conductor, is well defined 
on isogeny classes of elliptic curves over Q. 

Although we have reduced a Weierstrass equation E over Q to E over F p , 
we have not yet discussed reducing the points of the curve E to points of the 
curve E. This will be done in the next section, after we extend the reduction 
map Z — > F p suitably. 

The preface to this book described the Modularity Theorem in terms of 
values a p (E), defined as p minus the solution-count of a Weierstrass equation 
modulo p. Equivalently, 

Definition 8.3.1. Let E be an elliptic curve over Q. Assume E is in reduced 
form. Let p be a prime and let E be the reduction of E modulo p. Then 



p I A min (E) p I N E , 



fo 



if E has good reduction at p, 

if E has multiplicative reduction at p, 

if E has additive reduction at p and p ^ {2,3}, 

if E has additive reduction at p and p £ {2,3}. 




[2 + 6, 



'p 



a p (E)=p+l-\E(F p )\. 
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This definition is the same as before because the reduced curve contains the 
infinite point Og. That is, subtracting the number of F p -points on E from p+ 1 
is the same as subtracting the solution-count working in Z/pZ from p. The 
reason to carry out either subtraction is that as x varies through F p we expect 
the Weierstrass equation modulo p to have two solutions (x, y) for roughly half 
of the values of x modulo p and no solutions for_the other x- values, giving 
a total of p points. Thus the estimated value of |_E(F P )| is p+ 1, and a p (E) 
measures how the actual value differs from its estimate. The next proposition 
expresses a p (E) in terms of the Frobcnius map, cf. the introductory remarks 
to this chapter. 

Proposition 8.3.2. Let E be an elliptic curve over Q and let p be a prime 
such that E has good reduction modulo p. Let <t P; * and a* be the forward and 
reverse maps ofPic°(E) induced by a p . Then 

a p (E) — <7 P; * + a* as endomorphisms ofPic°(E). 
(Here the left side means multiplication by a p (E).) 
Proof. An clement x e F p satisfies x v = x if and only if x € F p . Thus 

E(F p ) = {PeE:P a »=P} = ker(> p - 1), 

and so since a p — 1 is separable, 

\E(F p )\ = | ker(a p - 1)| = deg(a p - 1). 

The general formula h* o h* = deg(h) was given for characteristic 0 in Sec- 
tion 7.3, and we assert without proof that it holds in characteristic p as well. 
It is clear that the induced forward map of a sum of isogenics is the sum of the 
induced forward maps. The same fact for reverse maps is not obvious, but it 
was shown near the end of Section 6.2 in the complex analytic setting and we 
now assert without proof that it holds in characteristic p as well. Therefore, 
computing with endomorphisms of Pic (E), 

\E(F P )\ - deg(<7 p - 1) = (<7 P - 1), o (<j p - 1)* 

= Op,* - 1*) o O p - 1*) = V + 1 - - a*. 

The proposition follows by definition of a p (E). □ 

If E is an elliptic curve over Q and p is prime then Definition 8.3.1 extends 

to 

a r (E) =p e + l- \E{F p e)\, e>l, 
and Proposition 8.3.2 extends to 

a p e(E) = a p e^ + <7* e , e>l. 
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Further defining a\(E) = 1, these solution-counts of an elliptic curve sat- 
isfy the same recurrence as the coefficients a p t (/) of a normalized eigenform 
in S2(r 0 (N)) as in Proposition 5.8.5, 

a r (E) = a p (E)a p e-i(E) - l E (p)pa p .-2(E) for all e > 2. (8.21) 

Here 1_e is the trivial character modulo the algebraic conductor Ne of E, so 
1e(p) is 1 for primes of good reduction and 0 for primes of bad reduction. 
The recurrence further indicates the close relation between elliptic curves and 
cigcnforms. We will prove it in Exercise 9.4.2. The consequence that we want 
here is (Exercise 8.3.5) 

Proposition 8.3.3. Let E be an elliptic curve over Q, and let p be a prime 
such that E has good reduction at p. Then the reduction is 

ordinary if a p (E) ^ 0 (modp), 

supersingular if a p (E) = 0 (mod p). 

Exercise 8.3.6 provides some examples of the ideas in this section. 
Exercises 

8.3.1. Let p > 3 be prime. Let £ be a Weierstrass equation over Q with 
integral coefficients, and suppose that v p (A) > 12 and v p (c^) > 4. Show 
that v p (cq) > 6 as well. The admissible changes of variable from Section 8.1 
put E into the form (8.3) and multiply its discriminant by 6 12 , so that 
^ p (Zi(8.3)) = v p (A). Show that from here the admissible change of vari- 
able (x,y) = (p 2 x' ,p 3 y') gives an equation E' in integral form such that 
v p (A') = v p (A) — 12. Thus E is not in global minimal form. (A hint for this 
exercise is at the end of the book.) 

8.3.2. Let E be a Weierstrass equation (8.1) with a, e Z and let A be its 
discriminant. For primes p \ A the discriminant is already minimal at p. For 
each prime p \ A an admissible change of variable 

(x, y) = (u\x' + r p , uly' + s p u 2 p x' + t p ), u p , r p , s p , t p e Q 

transforms Btoa new Weierstrass equation E' p with coefficients a! i p € Z and 
discriminant A p = A/u p 2 minimal at p. 

(a) Explain why v p {u p ) > 0 and similarly for r p , s p , and t p . (Hints for this 
exercise are at the end of the book.) 

(b) Let u — Y\ p p UplyUp " 1 . Show that any admissible change of variable 

(x, y) = (u 2 x' + r, u 3 y' + su 2 x' + t) 



gives a Weierstrass equation E' with discriminant A' = ±A- 
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(c) Show that r, s, and t can be chosen to make a- e Z as follows. The 
results from Exercise 8.1.1(b) give 





= u p a' lp + 2(s - 


- s p ), 




u 2 a' 2 


= u l a '2, P + 3(r - 


- r p ) - (ai + s + s p )(s - s p ), 




u 3 a 3 




-r p )+2{t-t p ), 




u 4 a' 4 


= Upa' ip - a 3 (s 


- s p ) + 2a 2 (r - r p ) - ai(rs - 


TpSp ~\~ t 




+ 3(r 2 -r 2 p 


) 2(st Sptp), 




u 6 a' 6 


= u l a e. P + a 4 r 


- r p ) - a 3 (i - t p ) + a 2 (r 2 - r\ 


*)-ai(rf 




+ (r 3 4) 


-if-tl). 





From the first formula, 

Vpia'i) + v p {u) > mm{vp(a' l p ) + v p (u p ), v p {2) + v p (s - s p )}, 

and since v p (u) = v p (u p ) and Vp{a' l p ) > 0 this shows that fp(ai) > 0 if 
i>p(s — s p ) > v p (u). Working similarly with the other equations, show that the 
condition 

min{z/ p (r - r p ),v p (s - s p ), v p (t - t p )} > &v p (u) 

makes the coefficients a- integral. Apply the Chinese Remainder Theorem to 
show that there are integers r, s, and t satisfying the condition for all primes p. 

8.3.3. Let E and E' be two global minimal Weierstrass equations, and suppose 
an admissible change of variable x = u 2 x' + r, y = u 3 y' + su 2 x' + t takes E 
to E'. 

(a) Explain why u = ±1. 

(b) Use the formulas from Exercise 8.1.1(b) relating the coefficients b[ 
and bi to show that r G Z. (Hints for this exercise are at the end of the book.) 

(c) Use the formulas from Exercise 8.1.1(b) relating the coefficients d i 
and ai to show that s e Z and that t £ Z. Thus the change of variable 
reduces modulo p for all primes p. 

8.3.4. Consider an affinc algebraic curve over Q, 

Q : x 2 - dy 2 = 1, d e Z, 0. 
For any prime p not dividing 2d define 

a p (Q)=p-\Q(Fp)\. 

Show that the maps 

(l + dt 2 2t \ . . 
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fay) (-1,0) ^ oo 

x + 1 

define a bijection between P 1 (F p ) — {t : dt 2 = 1} and Q(F p ). Use this to show 
that a p (Q) is the Legendre symbol (d/p). As explained in the preface, the 
values a p (Q) therefore arise as a system of eigenvalues. 

8.3.5. Use the recurrence (8.21) to prove Proposition 8.3.3. (A hint for this 
exercise is at the end of the book.) 

8.3.6. (a) Consider the elliptic curve E : y 2 = x 3 - 1 over Q. Show that E 
is in global minimal form and has good reduction at all p > 3. What is the 
reduction type of E at p — 2? p — 3? (Hints for this exercise are at the end 
of the book.) 

(b) Let p be prime, p = 2 (mod 3). Show that the map x i— > x 3 is an 
automorphism of F* , and therefore the map x n- x 3 — 1 bijects F p to itself. Use 

this to count that |£?(F p )| = p + 1, i.e., a p (E) = 0. Thus E has supersingular 
reduction at p by Proposition 8.3.3. 

(c) More generally, let p be an odd prime and let E : y 2 — f(x) be 
an elliptic curve over F p . Recall from elementary number theory that the 
multiplicative group F* is cyclic. For each point x € F p show that the number 
of points (x,y) <E E(F p ) is 1 + /(x)^- 1 )/ 2 , interpreting /(a;)^" 1 )/ 2 e F p as 
one of —1,0, 1 € Z. Show that therefore 

\E(F p )\ = l+p+ /(z) (p - 1)/2 = l+£ ./-(*) (p - 1)/2 (modp). 
Show that for any nonnegative integer i, 




0 ifi = 0orz^0 (mod p — 1), 
— 1 if i > 0 and i = 0 (mod p — 1). 



Therefore the sum needs to be taken only over the terms of f{x) lyP ^ 1 ^ 2 whose 
power of a; is a positive multiple of p — 1. Show that the only such is the x v ~ x 
term. 

(d) Returning to the curve E : y 2 = x 3 — 1, let p > 3, p = 1 (mod 3). Use 
part (c) to show that 

\E(F p )\ ee 1 - (-1) (J, - 1)/6 (^I ^3) * 1 (modp), 

and consequently a p (E) == 0 (mod p). Thus -E 1 has ordinary reduction at p by 
Proposition 8.3.3. 

(e) Similarly, show that the curve E : y 2 — x 3 — x is in global minimal 
form with bad reduction only at p = 2, and determine whether this reduction 
is multiplicative or additive. Use part (c) to describe the good reduction type 
of E at each prime p > 2 in terms of p (mod 4) . 
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8.3.7. Show that the Weierstrass equation E : y 2 + xy = x z + x 2 + 44x + 55 
is in global minimal form. What are the primes of good, multiplicative, and 
additive reduction? 



8.4 Elliptic curves over Q and their reductions 

This section discusses reducing an elliptic curve over Q modulo a maximal 
ideal p of Z, where Q is the field of algebraic numbers and Z is the ring of 
algebraic integers. 

We begin with a brief discussion of the maximal ideal p itself. Given such 
an ideal, the intersection pn Z is a maximal ideal of Z, so it takes the form pZ 
for some rational prime p and we say that p lies over p. Conversely, given 
a rational prime p, the set pZ is an ideal of Z, and so by Zorn's Lemma 
from set theory there exists a maximal ideal p containing it. More specifically 
to this situation, algebraic number theory says that for each number field 
K C Q with its ring Ok of algebraic integers, the ideal pOk of Ok factors 
into a nonempty finite product of positive powers of the maximal ideals of Ok 
containing it, 

3k 

K>k = IKk- (8-22) 

i=i 

(A simple example of this was given in Section 3.7, the factorization of primes p 
in the number ring Z[/x 6 ]-) Since Q = 1J K and Z = U K Ok, a maximal ideal p 
of Z containing pZ is a corresponding union of maximal ideals over all number 
fields, 

p = |JpK. ( 8 -23) 

K 

Conversely, such a union is a maximal ideal of Z if and only if the number 
field ideals arc chosen compatibly, meaning that if K C K' then pK C Pk' • 
In sum, every maximal ideal p of Z lies over a unique rational prime p, and 
given a rational prime p, every compatible union p of the form (8.23) is a 
maximal ideal of Z lying over p. From now on, if the symbol p is in use then 
p is understood to denote its underlying rational prime. 
The localization of Z at p is 

z (p) = i x /y -x,yeZ, y <£ p}, 

a subring of Q since maximal ideals are prime. The ideal pZ( p ) of Z( p ) is 
its unique maximal ideal since its complement is the set Z^ of units, mak- 
ing the quotient Z( p )/pZ( p ) a field. There is a natural isomorphism of fields 
(Exercise 8.4.1(a)) 

Z/p Z (p) /pZ (p) , a + p^ a + pZ (p) . 
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Also, Z/p is an algebraic closure of Z/pZ (Exercise 8.4.1(b)). So we view 
Z(p)/pZ( p ) as F p , giving the natural surjection with kernel pZ( p ) 

~:Z (p) — >Fp, 5 = a + pZ (p) . (8.24) 

Since pZ( p ) n Z = pZ (Exercise 8.4.1(c)), the reduction map (8.24) extends 
the earlier reduction map Z — > F p of (8.20). 

Lemma 8.4.1. Let p be a maximal ideal of Z. Let a be a nonzero element 
o/Q. Then at least one of a or 1/a belongs to Z( p ). 

Proof. Consider the number field K = Q(a). Let Ok be its ring of algebraic 
integers and let p K = p n Ok- The localization of Ok at p K is 

Ck,( P ) = {x/y : x,y e Ok, y £ Pk}, 

a subring of K containing Ok and with unique maximal ideal PkOk.(p) ■ There 
is a valuation 

z/p K : K — > ZU {+oo}, 

a surjection taking each nonzero x € Ok.(p) to the largest d e N such that 
a; G Pk^k,(p)- Constructing this number field valuation uses the facts that the 
ideal PkOk.(p) 18 principal and that the ring Ok.(p) is Noetherian, analogously 
to how we constructed the function field valuation at a nonsingular point of 
an algebraic curve in Section 7.2. Then v Vvi — ev v on Q, where e e Z+ is 
the power of pK occurring in the factorization (8.22), and v p is the p-adic 
valuation from the previous section. The valuation v = f PK has the standard 
properties, 

u{rr') = v(r) + v(r'), 
v(r + r') > min{z^(r), v(r')} with equality if v(r) ^ v{r'). 

With the valuation in place the lemma is clear: if v{a) > 0 then a e Ok.(p) C 
Z ( p), and if v{a) < 0 then v(\/a) > 0. □ 

The proof of Lemma 8.4.1 used valuation theory in a number field K 
because there is no corresponding valuation v — v 9 on Q. But the lemma is 
set in Q, and it enables us to work systematically over Q from now on. 

To reduce an elliptic curve over Q modulo p, consider any Weierstrass 
equation E over Q. By Exercise 6.4.4 every algebraic number takes the form 
of an algebraic integer divided by a rational integer. It follows that an ad- 
missible change of variable (x,y) = (u 2 x' ,u 3 y') with 1/u € Z + transforms E 
to a Weierstrass equation whose coefficients a[ = aij u l lie in Z and therefore 
in Z( P ). Thus we may assume that the Weierstrass equation E is p-integral to 
begin with, meaning the coefficients are in Z( p ), 



E : y 2 + aixy + a 3 y = x 3 + a 2 x 2 + a^x + a 6 , Oi, . . . , a 6 <E Z( p ). 
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Any p-integral Weierstrass equation reduces via (8.24) to a Weierstrass equa- 
tion over F p , 

E : y 2 + aixy + a 3 y = x 3 + d 2 x 2 + a 4 x + a 6 , a 1 , . . . , a 6 G F p . 

Proposition 8.1.3 and other results from Section 8.1 combine to show that 

• E defines an elliptic curve over F p if and only if A ^ 0, in which case 
the curve either is ordinary with E\p] = Z/pZ, or supersingular with 

• E defines a singular curve over F p with a node if and only if A = 0 
and c 4 7^ 0, in which case the rest of E forms a multiplicative group 
isomorphic to F p , 

• E defines a singular curve over F p with a cusp if and only if A = 0 
and C4 = 0, in which case the rest of E forms an additive group isomorphic 
to F p . 

Thus we can speak of the ordinary supersingular, multiplicative, or additive 
reduction type of any p-integral Weierstrass equation. 

We want the reduction type to make sense for Q-isomorphism classes of 
elliptic curves over Q, that is, for Q-isomorphism classes of nonsingular Weier- 
strass equations over Q. An admissible change of variable over Q puts any 
such equation into p-integral Legendre form (Exercise 8.4.2(a,b)), 

E:y 2 = x{x-l){x-X), A $ {0, 1}, AeZ (p) . 

Formulas from Section 8.1 specialize for Legendre form to (Exercise 8.4.2(c)) 

A = 16A 2 (1 - A) 2 , C4 = 16(l-A(l-A)). 

Assume now that p does not lie over the rational prime 2. Then the formulas 
for A and C4 give the implications (using that maximal ideals are prime for 
the first one) 

A e pZ (p) => A(l - A) e pZ (p) => c 4 i pZ (p) . 

That is, the conditions for additive reduction, A = £4 = Op , are incompatible 
with p-integral Legendre form. Therefore every elliptic curve over Q has a 
p-integral Weierstrass equation with good or multiplicative reduction when p 
does not lie over 2. This result holds as well when p does lie over 2, by an 
argument using the Deuring form of a Weierstrass equation — see Appendix A 
of[Sil86]. 

A p-integral Weierstrass equation with good or multiplicative reduction is 
called p -minimal. Additive reduction being the worst type, we avoid it, as we 
now know we can. Consider only p-minimal Weierstrass equations, and define 
two such equations to be equivalent when they differ by an admissible change 
of variable over Q. 
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Proposition 8.4.2. Ordinary reduction, super singular reduction, and multi- 
plicative reduction are well defined on equivalence classes oj p -minimal Weier- 
strass equations. If E and E' are equivalent p-minimal Weierstrass equations 
with good reduction at p then their reductions define isomorphic elliptic curves 
over F p . 

Proof. If E and E' are equivalent then by Exercise 8.1.1(b) 

u 12 A' = A, u 4 c' 4 = c 4 , 

where u comes from the admissible change of variable taking E to E' . Recall 
the disjoint union mentioned early in this section, 

Z( P ) = Z( p) UpZ (p) . 

Assume A' e ^(p)- If a l so ^ <= pZ( P ) then u 12 e pZ( P ) and thus u 4 e pZ( p ) so 
that C4 G pZ( p ). This means that E has additive reduction, impossible since 
E is p-minimal. Thus there is no equivalence between equations of good and 
multiplicative reduction. 

Given a change of variable between two equations of good reduction, we 
may further assume by Lemma 8.4.1 that u G Z( p ), and so the relation u 12 A' = 

A shows that u € Z( p N. Therefore u ^ 0 in F p . Also the other coefficients r, s, t 
from the admissible change of variable taking E to E' lie in Z( p ), similarly to 
Exercise 8.3.3(b,c) (Exercise 8.4.3), so they reduce to F p under (8.24). The 
two reduced Weierstrass equations differ by the reduced change of variable, 
giving the last statement of the proposition. Since isomorphic elliptic curves 
have the same p-torsion structure, this shows that ordinary and supersingular 
reduction are preserved under equivalence. □ 

The proposition shows that if E is an elliptic curve over Q then its reduc- 
tion type at p is well defined as the ordinary, supersingular, or multiplicative 
reduction type of any p-minimal Weierstrass equation for E. Furthermore, the 
proposition shows that if the reduction is good then it gives a well defined 
elliptic curve E over F p up to isomorphism over F p . 

The results of this section explain some earlier terminology Any elliptic 
curve E over Q can be viewed instead as a curve Eq over Q. Let p e Z be 
prime and let p C Z be a maximal ideal lying over p. We have shown that 
ordinary, supersingular, and multiplicative reduction of E at p do not change 
upon reducing Eq at p instead, but additive reduction of E at p improves 
to good or multiplicative reduction of Eq at p. This motivates the words 
"semistable" for good or multiplicative reduction and "unstable" for additive 
reduction, as given in the previous section working over Q. Exercise 8.4.4 gives 
an example of an elliptic curve over Q whose additive reduction at a rational 
prime p improves in Q. 

Good reduction is characterized by the j-invariant of an elliptic curve, 
something we can check without needing to put the Weierstrass equation into 
p-minimal form or even into p-integral form. 
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Proposition 8.4.3. Let E be an elliptic curve over Q and let p be a maximal 
ideal of Z. Then E has good reduction at p if and only if j(E) £ Z( p ). 

Proof. Suppose E has good reduction at p. We may assume the Weierstrass 
equation for E is p-minimal. Thus A £ and c 4 £ Z( p ), so that j(E) = 
c\/A € Z( p ) as desired. 

For the other direction, assume the Weierstrass equation of E is in p- 
integral Legendre form, and again assume that p does not lie over 2. Since 
A = 16A 2 (1 — A) 2 , showing that E has good reduction at p reduces to showing 
that A(l— A) ^ pZ( p ). But since also C4 = 16(1— A(l— A)), the relation j = c\jA 
is 

jA 2 (l-A) 2 = 16 2 (l-A(l-A)) 3 . 

The assumptions are that j £ Z( p ) and p does not lie over 2. Thus if A(l— A) lies 
in pZ( p ) then so does the left side, while the right side does not, contradiction. 
For the Deuring form proof when p does lie over 2, again see Appendix A 
of[Sil86]. □ 



So far we have reduced a Weierstrass equation, but not the points them- 
selves of the elliptic curve. To reduce the points, we first show more generally 
that for any positive integer n the maximal ideal p determines a reduction 
map 

~ : P"(Q) — > P n (F p ). (8.25) 

To see this, take any point 

P = [ % ..,i„]eP"(Q), 

and consider the values m £ N such that P has a representative with all of 
xq, . . . ,x m in Z and at least one of them equal to 1. Such values of m exist, 
e.g., the smallest m such that x m ^ 0. And given such an m that is less 
than n, if x m+ i £ Z( p ) then m + 1 also works, but otherwise \/x, m+ i £ Z( p ) 
by Lemma 8.4.1 and so multiplying P through by l/x m+ i shows that again 
m+ 1 works. Thus P has a representation such that all coordinates lie in Z( p ) 
and some Xi is 1. This representation reduces to 

P=[x 0 ,...,x n ] eP"(F p ). 

— * 

The scalar quotient of two such representations of P must belong to and 
thus reduce to F p , so the two representations reduce to the same element 
of P"(Fp) and the reduction map is well defined. The description of the map 
shows that an affine point P = (x\, . . . , x n ) = [1, x\, . . . , x n ] of P"(Q) reduces 
to an affine point of P"(F p ) if and only if all of its coordinates lie in Z( p ). 

In particular, if E is an elliptic curve E over Q or over Q and E is its 
reduction at p or at p then the points of E, a subset of P 2 (Q), reduce to 
points of E since 
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E(x,y)=E(^)=6^ = 0 % . 

Since the only nonaffine point of any elliptic curve is its zero, the end of the 
previous paragraph shows that reduction from E to E has for its kernel the 
affine points whose coordinates are not both p-integral, 

ker(~) = J E-Zj p) . (8.26) 

Let E be an elliptic curve over Q and let N be a positive integer. To 
study the reduction of iV-torsion at a maximal ideal p, recall the iVth division 
polynomial ?pN from Section 7.1, satisfying 

[N]{P)=0 E ^(P)=0, ^ N eZ[g 2 ,g 3 ,x,y}. 

An elementary argument (see [Sil86]) shows that for Weierstrass equations 
y 2 = x 3 — g 2 x — 33, i.e., for Weierstrass equations in the form (8.3), ipN lies 
in Z[g 2 , 33, x, y 2 ] if N is odd and lies in yZ[g 2l 33, x 7 y 2 ] if N is even. In either 
case ipjj can be viewed as an element of Z[g 2 , 33, x] after using the Weierstrass 
relation to eliminate y, and then it takes the form 

^ 2 N (x) = N 2 x N2 - 1 + ---€Z[g 2 ,g 3 ,x]. 

Define the localization of Z at p, 

Z (p ) = {x/y :x,yeZ, y <£ pZ} = {r E Q : v p (r) > 0}. 

For p \ N, dividing 1/^(2;) 2 through by A'' 2 gives a monic polynomial that is 
still p-integral, 

ip 2 N {x)/N 2 - x" 2 - 1 + ■ e Z {p) [g 2 ,g 3 ,x}. 

Suppose the Weierstrass equation of E is in p-minimal form, and suppose 
that p does not lie over 2 or 3. The admissible changes of variable at the 
beginning of Section 8.1 to reduce the general Weierstrass equation (8.1) to 
the form (8.3) leave E in p-integral form. Further suppose that p \ N . Then 
the monic p-integral polynomial condition on the x-coordinates of E[N] shows 
that they are p-integral (Exercise 8.4.5(a)). Consequently the y-coordinates 
of E[N] are p-integral as well (Exercise 8.4.5(b)). Thus (8.26) shows that the 
reduction E[N] — > E[N] is injective, and it is therefore surjective since E[N] 
and E[N] are isomorphic finite groups. 

Proposition 8.4.4. Let E be an elliptic curve over Q with good reduction 
at p. Then: 

(a) The reduction map on N -torsion, 

E[N] — > E[N], 



is surjective for all N. 
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(b) Any isogenous image E/C where C is a cyclic subgroup of order p also has 
good reduction at p. Furthermore, if E has ordinary reduction at p then so 
does E/C, and if E has supersingular reduction at p then so does E/C. 

We have shown part (a) when p \ 6N, but a complete proof is beyond 
the scope of this book. The substance of part (b) is that good reduction 
remains good under the isogeny, also not proved here. Granting this, it follows 
that ordinary reduction remains ordinary and supersingular reduction remains 
supersingular. Let <p : E — > E' be the isogeny from E to E' = E/C and let ip : 
E' — > E be its dual, so also ip is the dual of ip. The pending Theorem 8.5.10 
says that there exist reduced maps 

(p : E — $ :E> — > E 

such that ip o tp — [N]^ and <p o ip = [N]-^,. Thus the map <p o tp o ip has two 
descriptions, 

[p\w °^^ 0 bli :£ — > E ' ■ 

The descriptions combine to show that deg sep ([p]) is the same on E and on E' . 
That is, \E\p]\ — \E'\p]\ since deg sep = |ker | for isogenics, cf. (8.13). 

Exercises 

8.4.1. (a) Show that there is a natural isomorphism Z/p Z/ p )/pZ( p ). 

(b) Show that Z/p is an algebraic closure of Z/pZ as follows. First show 
that any (3 € Z/p is algebraic over Z/pZ. Second use the fact that any monic 
polynomial g £ (Z/pZ)[x] takes the form g = f where / e Z[x], to complete 
the argument. (A hint for this exercise is at the end of the book.) 

(c) Show that pZ (p) C\Z=pZ. 

8.4.2. (a) Show that the general Weierstrass equation over Q is taken by an 
admissible change of variable over Q to Legendre form y 2 = x(x — l)(x — A) 
where Ae Q-{0,1}. 

(b) Change this to y 2 = x(x — l)(x — 1/A) by another admissible change 
of variable over Q. Why may we thus assume that A G Z( p )? 

(c) Confirm the formulas for A and C4 arising from Legendre form. 

8.4.3. Show that any admissible change of variable over Q between two p- 
integral Weierstrass equations of good reduction entails r, s,t e Z( p y (A hint 
for this exercise is at the end of the book.) 

8.4.4. (a) Consider the Weierstrass equation E : y 2 — x 3 — 9x. Show that it 
is in reduced form over Q and that it defines an elliptic curve over Q with 
additive reduction at p = 3. 

(b) Consider the number field K = Q(/i6) where /j>q — e 27 ™/ 6 , satisfying 
the polynomial x 2 — x + 1 . Its ring of integers is 
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O k = Z[/i 6 ] = {a + bfi 6 : a, b e Z}, 

denoted A in Section 3.7. From that section, the factorization of 3 in Ok is 
30 K = p K where p K = (1 + Me) - Let tt = 1 + ^. Show that 3 = /i|?r 2 in 0 K - 
(c) Now view E as an elliptic curve over K. Apply the admissible change 
of variable over K 

What is the reduction type of E at pK? What is the reduction type of E as 
an elliptic curve over Q at any maximal ideal p lying over 3? (A hint for this 
exercise is at the end of the book.) 

8.4.5. (a) Let E be a p-minimal Wcierstrass equation over Q in the form (8.3). 
Let p lie over p where jj ^ 2,3. Let N be a positive integer such that p \ N. 
Show that the x-coordinates of E[N] are p-integral. 

(b) Show that the y-coordinates of E[N] are p-integral as well. 

8.5 Reduction of algebraic curves and maps 

Reducing Weierstrass equations and the points of elliptic curves from charac- 
teristic 0 to characteristic p in the previous two sections was fairly straight- 
forward. But already the results of Proposition 8.4.4, that good reduction 
surjects as a map of TV-torsion and is preserved under isogeny, are beyond 
the scope of this book. Even granting that good reduction is preserved, the 
argument that consequently so are its ordinary and supersingular subtypes 
still needed to quote facts about reducing isogenics to characteristic p. The 
subject of the next section, reducing modular curves from characteristic 0 to 
characteristic p, is much more substantial than any of this. The present section 
states the results to be quoted from algebraic geometry, incidentally justify- 
ing a few unsupported assertions from earlier. The statements are given in 
elementary terms, making them cumbersome in places but perhaps accessible 
to more readers than the scheme-theoretic language of modern algebraic ge- 
ometry. The results Theorem 8.5.7 and Theorem 8.5.9 not proven here follow 
from Theorem 9.5.1 of [BLR90] . 

Recall the localization of Z at p, 

z (p) = i x /y -x,yeZ, y<£ pZ}, 

a subring of Q containing Z and having unique maximal ideal pZ^ p y This 
construction eliminates all primes but p from the ideal structure of Z( p ) . There 
is a natural isomorphism 

Z/pZ ^ Z {p) /pZ [p) , a+pZ ^ a + pZ {p) , 

so that the reduction map 
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: Z (p ) — >F p , a = a + pZ( p) 

is a well denned surjection with kernel pZ( p ). The definition of reducing an 
algebraic curve over Q to an algebraic curve over F p uses the localization Z( p ). 

Definition 8.5.1. Let C be a nonsingular affine algebraic curve over Q, de- 
fined by polynomials ifi, . . . ,(p m € Z^[xi, . . . ,x n ]. Then C has good reduc- 
tion modulo p (or at p) if 

(1) The ideal I = (<pi, . . . ,(p m ) °/Z( p )[xi, . . . , x n ] is prime. 

(2) The reduced polynomials <pi,..., (p m G F p [xi, . . . , x n ] define a nonsingular 
affine algebraic curve C over F p . 

In this case C is the reduction of C at p. 

Without condition (1) the curve and its reduction need have nothing to 
do with each other. For instance, consider the nonprime ideal 

1= {pipy- !),(</- x 2 )(py - 1)) cZ {p) [x,y}. 

The ideal Iq C Q[x, y] with the same generators as I is {py — 1), defining the 
curve y = 1/p in Q 2 , a translate of the z-axis; but the reduction I C F p [x,j/] 
is {y — x 2 }, defining a parabola in F . See Exercise 8.5.1 for a generalization 
of this example. 

An observation is required before extending Definition 8.5.1 from affine 
curves to projective curves. Let k be any field. Introduce the notations x — 
(xo,-.-,x n ) and x^ = {x 0 , ■ ■ ■ , 1, x i+ i, . . . , x n ) for i = 0, ...,n, and 
= <Pi x (i)) f° r <P e k[a;]. For any homogeneous ideal 

J = (M) ck[x], 

its ith dehomogenization for i = 0, . . . , n is 
and its ith rehomogenization is 

^(j),hom = ({<P(»),hom}) C k[x], 

where ^(^^hom means to multiply each term of (p^ by the smallest power of Xi 
needed to make all the terms have the same total degree. Thus ip = xfip^y hom 
for some e > 0. 

Consider the ideal I = {x 0 xi + x n x 2 ). Its 0th dehomogenization is 7( 0 ) = 
(xi + x 2 ), a prime ideal that defines an algebraic curve, but 7(o),hom — lo I- 
On the other hand, 1^ = {x 0 (l + x 2 )) is not a prime ideal, but /(i) ; hom = I- 
This example shows that whether a dehomogenization defines a curve and 
whether it rehomogenizes back to the original ideal arc not equivalent. How- 
ever, 



338 



8 The Eichler-Shimura Relation and L-functions 



Lemma 8.5.2. Letk be afield. Let I C k[x] be the homogenization of a prime 
ideal C k[x/o)] that defines an affine algebraic curve. Then I is prime. For 
i = 0, ... ,n, if /(i) 7^ k[a;(j)] then 1^ is prime, I^^om — I> an d defines 
an affine algebraic curve. 

Proof. If tpip e /, where ip,ip € k[x], then V(o)' I / , (o) — (W0(o) <= /(o) and so (up 
to symmetry) <p (0) e J (0 ), giving (/? (0 ),hom € J. Since </?( 0 ),hom I also p € I. 
This shows that / is prime. 

If ipip e /(j), where p,r/> € k[x (i) ], then (^homV'hom = (^)hom € Thus 
(up to symmetry) <phom € / since I is prime, and consequently ip G since 
9 s = ^hom^i)- This shows that Iu\ is prime unless it is all of k[x(j)], not 
considered a prime ideal since the corresponding trivial quotient is not an 
integral domain. 

As already mentioned, each tp £ I takes the form ip = i^j^hom for some 
nonnegative integer e. On the one hand this gives the inclusion I C /(»vhom- 
On the other hand, if ^ k[x(j)] then Xj £ I, so ipu\hom <= I since / is 
prime. Thus if 1^ ^ k[x(j)] then /(i).hom C /, and this combines with the first 
inclusion to show that Iu),hom = I- 

If /(j) 7^ k[x(j)] then the field of quotients of k[x(j)]/J(,) can be identified 
with a subficld of the field of quotients of k[x]/7 that has transcendence de- 
gree 1 (Exercise 8.5.2). This shows that Iu\ defines a curve. □ 

The lemma supports the assertion in Section 7.2 that the function field of 
any nonempty affine piece of a projective curve is the function field of the entire 
curve, regardless of which piece is chosen. Examples such as (xoXi + X0X2) 
above, with extra powers of a variable, don't arise from homogenizing a prime 
affine ideal. 

Definition 8.5.3. Let Ch om be a nonsingular projective curve over Q defined 
by the homogenization I C Z( p )[x] of a prime ideal Lq\ C Z( p )[x(o)] as in 
Definition 8.5.1. Then Ch om has good reduction at p if for i = l,...,n, 
either the affine curve Ci defined by 1^ has good reduction at p or 1^ reduces 
to all o/F p [x(i)] so that Ci has empty reduction at p. The curve Chom defined 
by the homogenization (/(o))hom C F p [x] is the reduction of C at p. 

The reduced curve Ch om is defined by any nonempty reduction d of an 
affine piece of Chom, but this is not immediately obvious. Since dehomogeniz- 
ing and reducing commute, 

(-f(t))hom = ^(i),hom! 1 = 0, ...,U. 

Let J — /(o),hom- Then I C J and therefore J(j) C J(j) for each i, with J( 0 ) = 
/(o) i n particular. Since the reduced curve is defined by the homogenization 
of J(o), Lemma 8.5.2 says that it is defined by any that is not all of F p [x(j)]. 
But each such Ju\ is in fact Iu) since Ju) / Iu) is a prime ideal of the coordinate 
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ring F p [Cj] = F p [a;(j)]//(i). As such it is the zero ideal or the maximal ideal top 
of functions vanishing at some point P G Cj (see Exercise 8.5.3). But in the 
latter case the quotient field of the integral domain F p [xu^]/Jr^ does not 
have transcendence degree 1, and this is a contradiction since defines a 
curve. Therefore J^/I^ — {0}, i.e., Ju-s = Iu\ when Ju\ ^ F p \xn\[. After 
the pending Theorem 8.5.4 we will be able to show that if on the other hand 
J(i) = F p [a;(j)] then also 1^ = F p [x(j)], completing the argument that Chom 
is defined by any nonempty d. As in Section 7.2 we now drop the notation 
Chom to distinguish a projective curve from its affine pieces. 

Good reduction at p on one affine piece of a projective curve does not 
guarantee good reduction of the curve (Exercise 8.5.4). Also, just as a Weier- 
strass equation over Q needs to be put into p-minimal form by an isomorphism 
over Q in order to optimize its reduction type at p, the same can apply to a 
nonelliptic curve over Q. For example, the circle x 2 + y 2 = z 2 has bad reduc- 
tion at p = 2 since every point is singular, but it is isomorphic over Q to the 
line defined by no equations in one variable (Exercise 8.5.5), and this clearly 
has good reduction at all p. 

Curves as defined by equations are now reduced to characteristic p. To 
reduce the points of a curve, recall the reduction map (8.25) of projective 
space, 

: P"(Q) — ► P n (F p ). 

If C is a nonsingular projective algebraic curve with good reduction at p then 
this takes C to C. 

Theorem 8.5.4. Let C be a nonsingular projective algebraic curve over Q 
with good reduction at p. Then the reduction map C — > C is surjective. 

The idea of the proof is to argue ring theoretically, proceeding from a 
point in characteristic p to a point in characteristic 0 via ideals. The points 
of an algebraic curve C over any field k map to the maximal ideals of its 
coordinate ring k[C], each point Q mapping to the maximal ideal toq of 
functions that vanish at Q. The map surjects by Hubert's Nullstcllcnsatz, 
mentioned in Exercise 8.5.3. It injects if k is algebraically closed, and more 
generally toq = toq/ if and only if some automorphism of k over k takes Q 
to Q' by acting componentwise. Since C is defined by equations over k it is 
closed under this action. 

Proof. Let Q' be a point of C. Then Q' lies in some affine piece Cj of C. 
Since i is fixed we will simply call the affine piece C throughout the proof. 
Thus the affine coordinate ring is F p [C] = F p [xii)]/Iuy Consider the ring 
Z( P )[C] = Z/p) The map Z( p JC] — > F p [C] is a surjection with 
kernel (p) — pZ^[C], so that Z( p )[C]/(p) is naturally identified with F p [(7]. 
Let m C Z( p ) [C] be the inverse image of the ideal toq< of F P [C] corresponding 
to Q' . Then to is a maximal ideal of Z( p )[C], and p £ m. So far we have 
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z (p)[C] — > f p[C]. m — > m Q >. 

Let / G m be a lift to Z( p )[C] of a nonzero function in mg<. Because 
/ + (p) is nonzero and hence not a zero-divisor in the integral domain F p [C] , 
it follows that p + (/) is not a zero-divisor in Z( p )[C]/(/). Indeed, if p + (/) 
is a zero-divisor then /g = ph for some h € Z( p )[C] with /i ^ (/), and thus 
.9 i (p) since Z( p JC] is an integral domain, making / + (p) a zero-divisor 
as well, contradiction. Let P C Z( p )[C] be the lift of a minimal prime ideal 
of Z( p )[C]/(/) (see Exercise 8.5.6(a)) contained in m/(f). Thus P is nonzero 
since / G P. A fact from commutative algebra, that any minimal prime ideal 
of a ring is a subset of the ring's zero-divisors (Exercise 8.5.6(b~c)), shows 
that p + (/) £ P/(/). Now wc have 

PCmCZ (p) [q, pi P. 

Consider the coordinate ring Q[C] = Q[#(i)]/-^(i),Q where I(i),Q is the ideal 
of Q[a;(j)] with the same generators as the ideal 1^ of Z( p )[a;(j)]. Then PQ[C] 
is a nonzero prime ideal of Q[C] (it is proper because p ^ P), and so by 
Exercise 8.5.3, 

PQ[C] = m,Q for some Q eC. 

The quotient Q[C]/mg is a field naturally identified with the values taken by 
polynomial functions on C at Q, a subfield of the Galois extension K of Q 
generated by the coordinates of Q. Also P = tuq 0 Z( p )[C] (the containment 
"C" is clear; for the other, if h <G itiq then e P for some iV, and if 
/i € Z( p )[C] as well then h e P since p ^ P), and so the quotient Z( p )[C]/P 
maps to K. Thus we have the map given by evaluating at Q, 

e Q :Z (p) [C]/P^K, g + P^g(Q). 

Let Ok be the ring of integers of K. Lemma 8.5.5 to follow will show that 
there is a maximal ideal qx of Ok lying over p such that eg(Z( p ) [C]/P) is 
contained in the local ring 0K.(q K ) an( ^ e Q( m /^ > ) ^ s contained in its maximal 
ideal (Ik^k^k)- This gives a map 

e Q : Z ( P )[C]/ TO — >• OK,(q K )/qKO K ,( qK ), g + m^ g(Q) + q K 0 K ,( qK ). 

The reduction map C — > C depends on an initial choice of a maximal 
ideal p of Z lying over p. Let pK — pHK. A result from algebraic number the- 
ory, to be discussed in Section 9.1, is that some clement a G Gal(K/Q) takes 
qK to pK- The quotient Ok,(p k )/PkOk,( Pk ) is a finite extension field of F p , 
isomorphic to Ok/Pk- (This generalizes the isomorphism between Z( p )/pZ( p ) 
and Z/pZ at the beginning of the section.) Another result from algebraic num- 
ber theory to be discussed in Section 9.1 is that every automorphism of this 
field is the reduction to characteristic p of an automorphism r G Gal(K/Q) 
such that p K = Pk ■ Replacing a by ctt for any such r gives a new a that still 
takes qK to pK- Composing the map eg with a gives 
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e Q : Z( p) [C]/to — > Ok,(pk)/PkOk,( P k)> 9 + m t-> ,g(Q) CT + P k £>k,(pk)- 

Note that g(Q) a = g(Q a ), so this last map can also be denoted £q^. In 
particular, specializing g to the coordinate functions Xj shows that Q a has 
p-integral coordinates. 

We now have a commutative diagram 

z (p)[C] >Ck,(pk) 



F P [C] >F p , 

where the top row is g i— > g(Q a ), the vertical arrows are reduction to char- 
acteristic p, and the bottom row is 5 1 — >■ g(Q a ) for g € Fp[C]- The kernel of 
the bottom row is thus m^p, but since the diagram commutes the kernel of 
the bottom row is also the reduction of m, i.e., tuq'. As explained before the 
proof this means that Q' = {Q a ) T for some automorphism r of F p . But this r 
restricts to an automorphism of Ok.(pk)/PkCk.(p k )' an( ^ so as explained in 
the previous paragraph we may assume that it is the reduction of an auto- 
morphism t e Gal(K/Q) that fixes pK- That is, we may assume that r has 
already been incorporated into a, giving the desired result Q' — Q a . □ 

The following lemma was used in the proof of Theorem 8.5.4. 

Lemma 8.5.5. Let p be prime. Suppose S is a Z( p ysubalgebra of a number 
field K and M is a maximal ideal of S containing p. Then there is a maximal 
ideal q of Ok lying over p such that S C 0K,(q) an d M C qC*K,( q ). 

Specifically, the lemma was applied with S = Z( p )[C]/P and M = m/P. 
The more experienced reader should supply details for the following proof, 
while the reader with less background can skim it. 

Proof. (Sketch.) Let S' be the localization of S at M. Then S C 5" and S' 
is local with maximal ideal M' containing M. Let 5"' = <S"0k- Then S" is 
finitely generated as an S"-module, so S"/M'S" is nonzero by Nakayama's 
Lemma and thus M' is contained in some maximal ideal M" of S". It suffices 
to prove the lemma with S" and M" in place of S and M. Now, S" contains 
the set 

Ck,(p) — { s *= K : f q (s) > 0 for all q over p}. 

This is the localization of Ok at the multiplicative set Z — (p) . It is a principal 
ideal domain with only finitely many primes, these being q0K,(p) f° r q over p. 
Showing this relies on the fact that for each q there exists a uniformizer that 
is a unit at the finitely many other q' over p. (We proved a similar result for 
function fields in Chapter 7.) Using these uniformizers we can also show that 
there is a subset Q of the set of q over p such that 
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S" = {s £ K : z/ q (s) > 0 for all q £ Q}. (8.27) 

Indeed, for any q such that there exists an s £ S" with f q (s) < 0 we can 
multiply through by suitable powers of uniformizers to get an s' £ S" with 
Vq(s') = —1 and Vq'{s') > 0 for other q', and then add the product of the 
uniformizers at q' such that ty (s') > 0 to get an s" £ S" such that v q >(s") = 0 
for q' 7^ q. Doing this for each such q we get that S" takes the form (8.27). 
Since S" has a nonzero maximal ideal and since the maximal ideals of S" are 
M q ' = {s £ S" : v q (s) > 0} for each q £ Q, the set Q is nonempty. Thus 5"' 
and M" satisfy the desired conclusions of the lemma. □ 

We can now complete the jirgument that the reduced curve Ch om is defined 
by any nonempty reduction Cj of an affine piece of Chom- Let p be a maximal 
ideal of Q lying over p, and let C^i denote the points of C whose ith coordinate 
can be normalized to 1 with the others p-integral. The proof of Theorem 8.5.4 
shows that the map Cj ; i — > C{ is surjective. The case remaining to be proved 
is when is all of F p [x(j)]. But then x™ £ J for some to, and so x™ £ 

/(o) = >^(o)i i- e -i x i € I(o) since // 0 ) is prime. Writing Xi = f + pg for some 
/ £ /(o) shows that for any Q = (ai, . . . , a n ) = [1, a±, . . . , a n ] in Co we have 
cti £ p'Zi(p), so Q ^ Ci.i. Therefore C^i is contained in C — Co, which is finite 
(Exercise 8.5.7(a)), therefore C is finite and thus empty (Exercise 8.5.7(b)), 
i.e., = Fp[x (i) ] = Jj. 

With curves and their points reduced, the next object is to reduce mor- 
phisms. Doing so requires the following result. 

Lemma 8.5.6 (Krull Intersection Theorem, special case). Let R be a 

Noetherian domain and let a £ R be a nonunit. Then H^lo ~ {*-*}• 

Proof. Let J — P|^!Lo a€ Rj ^ TO ij • • • ; m fe be a set of generators for J, and let 
to be the column vector that they form. Since a J = J (Exercise 8.5.8), also 
am is a column vector of generators, and so there exists a k-by-k matrix A 
with entries in R such that 

to = aAm. 

That is, (1 — aA)m = 0 where 1 is the fc-by-fc identity matrix. But det(l — aA) 
is 1 modulo aR and hence nonzero, making 1 — aA invcrtible over the quotient 
field of the integral domain R. This shows that to = 0, giving J = 0 as desired. 

□ 

Let 

be a morphism over Q of nonsingular projective algebraic curves over Q with 
good reduction at p. To reduce h to a morphism h of the reduced curves, 
take a representation h — [ho, ■ • • , h r ]. Let J( 0 ) C Z( p )[x( 0 )] be the ideal whose 
homogenization defines C. We may assume that each hi lies in the subring 
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R = Z( p )[x(o)]//(o) of the coordinate ring Q[Co] = Q[£(o)]/-^Q,(o)- Let the 
p-adic valuation of each hi be 

v p {h t ) = max{e : hi £ p e R} E N U {+00}, 

and note that at least one v v (hi) is finite by the lemma. Then the p-adic 
valuation of h is 

v p (h) = min{z/ p (/ij) : i = 0, . . . , r}, 

and ft, rewrites as 

h=[h' 0 ,...,h' r ], h'i= p~ v ^ h ^hi for i = 0, . . . ,r. 

Each entry h[ still lies in i? but now some entry lies in R — pR, giving it 
nonzero reduction in F p [Co] = R/pR. The reduction h = [h' 0 , . . . , h' r ] is a 
rational map from Co to C. To see this, note that each point of Co takes the 
form P where P G C 0 , and h(P) 7^ 0 for all but finitely many such points. If, 
say, h'iiP) 7^ 0 then we want to show that h(P) lies in C[. But any element 
of the defining ideal takes the form g where g G I'^y Thus for P, i, 
and g as in this discussion, g(h(P)) is the reduction of g(h(P)), and this is 0 
because h(P) lies in C[. The argument has shown that h{P) = h(P) for all 
P € C that reduce to the complement of a finite subset of C. This statement 
is independent of which afline piece of C and which representatives hi were 
used to define h, although the finitejset can change if these do. In sum, h is 
well defined as a rational map from C to C Finally, by the general theory of 
algebraic curves the rational map extends to the desired reduced morphism, 

h:C—¥C i . 

The genus of an algebraic curve was mentioned in Section 7.8. An algebraic 
curve C over Q has genus 0 if and only if it is isomorphic over Q to the 
projective line, or equivalently if and only if its function field Q(C) is Q- 
isomorphic to a field Q(t). Genus 0 curves are exceptional, but as long as the 
target curve C has positive genus, the morphism h and its reduction h act 
compatibly on points and have the same degree. 

Theorem 8.5.7. Let C and C be nonsingular projective algebraic curves 
over Q with good reduction at p, and let C have positive genus. For any 
morphism h : C — > C the following diagram commutes: 

C—^C 
~ h — 

c — >c. 



Also, deg(h) = deg(ft). 
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Since C — > C surjects, h is the unique morphism from C to C that 
makes the diagram commute. Such a diagram can fail to commute when C 
has genus 0, for example the map h : P 1 — > P 1 where h[x,y] = [px,y] 
surjects over Q but it reduces at p to the zero map. Theorem 8.5.7 has the 
following consequences (Exercise 8.5.9): 

Corollary 8.5.8. Let C have positive genus. Then 

(a) If h : C — > C surjects then so does h : C — > C . 

(b) If also k : C — > C" and C" has positive genus then 

k o h — k o h. 

(c) If h is an isomorphism then so is h. 

Part (c) of the corollary shows that if C has positive genus then the iso- 
morphism class of C over F p depends only on the isomorphism class of C 
over Q. 

With curves and maps reduced compatibly, we also need the following 
functorial result. 

Theorem 8.5.9. Let C be a nonsingular projective algebraic curve over Q 
with good reduction at p. The map on degree-0 divisors induced by reduction, 

Div°(C) ^Div°(<5), ^np(P)^^n P (P). 

sends principal divisors to principal divisors and therefore further induces a 
surjection of Picard groups, 

Pic°(C) — ► Pic°(C), £>p(P)] ^ [5>p(P)]. 

Let C also be a nonsingular projective algebraic curve over Q with good re- 
duction at p, and let C have positive genus. Let h : C — \C' be a morphism 
over Q, and let h* : Pic°(C) — > Pic°(C") and h* : Pic°(C) — > Pic°(C") be 
the induced forward maps of h and h. Then the following diagram commutes: 

Pic°(C) -^Pic°(C") 

Pic°(C) ^^Pic 0 ^). 

The first statement of Theorem 8.5.9 is not obvious because the induced 
map on divisors does not send the principal divisor div(/) to div(/). The 
diagram follows from the first statement. 

These results on reducing curves over Q at a prime p don't yet support 
the assertions in the previous section about elliptic curves over Q and their 
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reductions at a maximal ideal p. However, the ideas here extend to reducing 
curves over any number field K at a maximal ideal pK> using the local ring 
0 K ,(p) an d the valuation v Pk as in the proof of Lemma 8.4.1. The ideas thus 
extend to reducing curves over Q at a maximal ideal p. 

Theorem 8.5.10. Let 

if : E — > E' 

be an isogeny over Q of elliptic curves over Q. Then there is a reduction 

<p:E — > E' 

with the properties 

(a) (p is an isogeny. 

(b) If ip : E' — > E" is also an isogeny then ip o ip = ip o (p. 

(c) The following diagram commutes: 

E—^E' 
E >E'. 

(d) deg(<£) = deg(p). 

Part (a) of Theorem 8.5.10 follows from the general results of this section 
and from the fact that (p is a nonconstant map taking 0 to 0. The statement 
in Section 8.2 that the map [p] is an isogeny of degree p 2 on elliptic curves 
over Ep follows from Theorem 8.5.10 since any such elliptic curve is the reduc- 
tion E of an elliptic curve E over Q, and [p]g is the reduction of [p]e- The 
facts quoted after Proposition 8.4.4 about reducing isogenics also follow from 
Theorem 8.5.10. 

Exercises 

8.5.1. Given polynomials ipi, . . . , ip m , <p, ipi, ...,ipi in Z( p )[a;i, . . . , x n ], find an 
ideal / C Z( p )[xi, . . . , x n ] such that I = (ipi, . . . , <p m ,p<f> — 1) as an ideal 
of Q \x\ , . . . , x n ] , but / = (tpi , . . . , •ipi) in F p [x\ , . . . , x n ] . (A hint for this exer- 
cise is at the end of the book.) 

8.5.2. (a) In the context of Lemma 8.5.2, show that the elements (f+I) /(g+I) 
of the field of fractions of k [:/;]// where / and g are homogeneous of the same 
degree form a subficld K. 

(b) Assume that Iu-\ is a proper ideal of k[cc(j)], so that a;, ^ /. Define a 
map k[£ W ]// (i ) — > K by / i-> (/ hom + I)/(xf + I) where d = deg/. Show 
that this map is well defined and injective, and show that it extends to an 
isomorphism from the field of quotients of \s[xu-\\/ Iu\ to K. 

(c) Show that K has transcendence degree 1 and therefore so does the field 
of quotients of k[#(j)]/J(j). (A hint for this exercise is at the end of the book.) 
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8.5.3. Let C be an algebraic curve over a field k and let k[C] be its coordinate 
ring. Let P be a point of C, let m — rap be the maximal ideal of k[C] 
associated to P, and let M = Mp be the corresponding maximal ideal of the 
local ring k[C]p, cf. Section 7.2. Show that there is a bijection from the prime 
ideals of m and the prime ideals of M, given by / i— > IM and having inverse 

JHto. Since the local ring is a discrete valuation ring, its only primes are 
{0} and M. Consequently, every nonzero prime ideal of k[C] is maximal since 
it lies in some maximal ideal and by a result called Hubert's Nullstcllensatz 
any such ideal takes the form mp. 

8.5.4. Find an ideal /(o) of Z( p )[xi, x?\ defining a nonsingular curve with good 
reduction at p, but such that the homogenized ideal / of Z( p ) [xo, x\, X2] also 
dehomogenizes to an ideal 7( 2 ) defining a nonsingular curve with bad reduction 
at p. (A hint for this exercise is at the end of the book.) 

8.5.5. Let k be any field with char(k) ^ 2. Show that the map [t, u] i->- 
[t 2 — u 2 ,2tu, t 2 + u 2 ] is an isomorphism from P x (k) to the curve x 2 + y 2 = z 2 
in P 2 (k) with inverse [x, y, z] [x + z, y] by showing that both compositions 
give the identity. Why does this fail if char(k) = 2? 

8.5.6. Let R be a commutative ring with 1. This exercise shows that R has 
minimal prime ideals and that any such ideal is a subset of the zero-divisors 
ofR. 

(a) Let I 0 D h D • • • be a descending chain of prime ideals of R. (The 
subscripts do not denote dehomogenizations.) Show that H^Lo ^ n IS a S am a 
prime ideal. Thus R has minimal prime ideals. 

(b) Let P be a minimal prime ideal of R. The localization of R at P is the 
ring Rp = R(R — P)^ 1 whose definition should be clear from Exercise 6.5.5 
and the various examples of localizing that we have seen since then. This ring 
is Noetherian and PRp is its only prime ideal. Part (c) will show that PRp 
contains only zero-divisors of Rp. Show that consequently P contains only 
zero-divisors of R. 

(c) To complete the argument, let S be a Noetherian ring with only one 
prime ideal /. To show that I contains only zero-divisors of S, consider ideals 
of the form 

Anns(x) = {s € S : sx = 0}, 1 £ S, 1 / 0. 

Note that any such ideal contains only zero-divisors of S. Since S is Noether- 
ian any ascending chain of such ideals eventually stabilizes, and so the set of 
such ideals contains maximal elements. Let J = Anng(:r) be such a maximal 
clement. Show that J is prime, so that J — I and the result follows. (A hint 
for this exercise is at the end of the book.) 

8.5.7. (a) Show that any nonempty affine piece of a nonsingular projective 
algebraic curve is all but finitely many points of the curve. (Hints for this 
exercise are at the end of the book.) 

(b) Show that any nonempty affine piece of a nonsingular projective alge- 
braic curve is infinite. 
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8.5.8. In the proof of Lemma 8.5.6, show that aJ = J. 

8.5.9. Prove Corollary 8.5.8. 

8.6 Modular curves in characteristic p: Igusa's Theorem 

Let N be a positive integer and let p be a prime, p \ N. The modular curves 
X\(N) and X 0 (N) have good reduction at p. This section describes the re- 
ductions. 

First we reduce the moduli space at p. Let p be a maximal ideal of Q 
lying over p. Recall from Proposition 8.4.3 that an elliptic curve E over Q 
with good reduction at p has j(E) £ Z( p j, so this reduces at p to j(E) in F p . 
We want to avoid j = 0, 1728 in the image. Denote this with a prime in the 
notation, i.e., the suitable restriction of the moduli space Si(7V) over Q is 

S^AOgd = {[E,Q] e Si (AT) : E has good reduction at p, j[E) £ {0, 1728}}. 

In characteristic p, let Si (TV) denote the moduli space over Fp, i.e., it consists 
of equivalence classes [E, Q] where E is an elliptic curve over F p and Q € E is 
a point of order N and the equivalence relation is isomorphism over F p . Again 
avoid j = 0, 1728 by defining 

Si (AT)' - {[E, Q] € Si (AT) : j(E) £ {0, 1728}}. 

The resulting reduction map is 

Si(JV)U— ^Si(JV)', [Ej,Q] H- [Ej,Q]. 

This surjects. Indeed, any Weierstrass equation over F p lifts to Z, and the 
discriminant of a lift is a lift of the discriminant, so nonzero discriminants lift 
to nonzero discriminants, i.e., elliptic curves lift to elliptic curves. Proposi- 
tion 8.4.4(a) shows that each A"-torsion point has a lift as well. Thus reduc- 
tion surjects from Si(A^) g d to Si(A'), and since Si(A^)g d is the inverse image 
of Si (AT)' the displayed reduction map surjects as well. Priming the moduli 
space in characteristic p removes only finitely many points. Even though prim- 
ing in characteristic 0 removes infinitely many points, all that matters for our 
purposes is that the reduction surjects. 

To define the reduction Xi (N) of X\ (N) at p, consider the universal elliptic 
curve Ej defined over F p (j), 

This is the curve of (8.5), with discriminant j 2 j (j — 1728) 3 and invariant j. The 
points (x,y) of Ej are a subset of P 2 (F p (j)). Let Q be any point of Ej with 
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order N, i.e., [N]Q = but [n]Q ^ 0^ for 0 < n < N. Let ip x . N € F p (j)[x] 
be the minimal polynomial of its x-coordinate x(Q). Define a field 

K 1 {N)=F p (j)[x]/(<p 1 , N (x))- 

We assert without proof that Ki(JV) n F p = F p , so Ki(iV) is a function field 
over Ep. The polynomial <pi t N defines a planar curve X L (7V) planar , possibly 

2 

singular, whose points are ordered pairs (j, x) € F p . There is a birational 
equivalence over F p from the planar curve to any nonsingular projective al- 
gebraic curve whose function field is isomorphic to Ki(N), as discussed after 
Theorem 7.2.5. 

Theorem 8.6.1 (Igusa). Let N be a positive integer and let p be a prime 
with p \ N . The modular curve X\(N) has good reduction at p. There is an 
isomorphism of function fields 

FpiX^N)) -^>Ki(JV). 

Moreover, reducing the modular curve is compatible with reducing the moduli 
space in that the following diagram commutes: 

S 1 (N)' gd ^X 1 (N) 



SW X!(N). 

Here the top row is the map [Ej,Q] i— > (j,x(Q)) to the planar model followed 
by the birational equivalence to Xi(N), and similarly for the bottom row but 
in characteristic p. 

The primes in the diagram thus now mean to remove finitely many addi- 
tional points as necessary for all the maps to make sense. A similar commuta- 
tive diagram but with the planar models is easy to establish (Exercise 8.6.1), 
but this is not enough to show that the diagram in the theorem commutes 
as well. The diagram extends to divisor groups, restricts to degree 0 divisors, 
and takes principal divisors to principal divisors, giving a modified diagram 
that we will use in the next section, 

Div^S^d) >Pic°(X 1 (7V)) 

(8.28) 

Div°(Si(A0') >Pic°(Xi(A0)- 

In this diagram the maps down the left side and across the bottom surject. 
The map across the bottom surjects by Proposition 7.3.1 since tpi(Si(N)') 
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is all but finitely many points of Xi(N). Although Chapter 7 was set in 
characteristic 0, the proof of the proposition holds in any characteristic. 

Starting instead from a point Q as above and from the minimal polyno- 
mial ip 0 ,N of J2d=i x ([d\Q)> a similar discussion applies to reducing X 0 (N). 
As usual, we will discuss X\(N) but then use Xq(N) in connection with 
the Modularity Theorem. For more on Igusa's Theorem see [Igu59], [DR73], 
[KM85]. 

Exercise 

8.6.1. Show that a diagram as in Igusa's Theorem but with the planar models 
of Xi(N) and X\(N) commutes. (A hint for this exercise is at the end of the 
book.) 
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Let N be a positive integer and let p \ N be prime. This section gives a 
description of the Hecke operator T p at the level of Picard groups of reduced 
modular curves, 

T p : Pic°(Xi(JV)) — ► Pic o (Xi(J\0). 

By Theorem 8.5.9 the Hecke operator (d) on Xi(N) reduces modulo p and 
passes to Picard groups to give a commutative diagram 

Pic°(X 1 (JV))-^Pic°(X 1 (i\0) 

(8.29) 

Pic°(A 1 (A))^Pic°(A 1 (A)). 
We want a similar diagram for the Hecke operator T p on Pic°(Ai(A)), 

Pic°(A 1 (A))^^Pic°(A 1 (A)) 

(8.30) 

Pic°(A 1 (A))^Pic°(A 1 (A)). 

However, since the top row here is not the pushforward of a morphism from 
X\(N) to X\(N), Theorem 8.5.9 doesn't give this diagram as it gives (8.29). 
Recall from Section 6.3 and from Exercise 7.9.3 that in fact the top row 
is a pullback followed by a pushforward via an intermediate curve, either 
X®(N,p) or Xi 0 (N,p). These curves do not have good reduction at p, but 
each is defined by equations over Z whose reduction at p is isomorphic to 
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two copies of X\(N) crossing at the values of j where Ej is supersingular. 
This structure makes it possible to show that T p reduces and even to compute 
its reduction. Alternatively, results on Abelian varieties (e.g., Theorem 9.5.1 
of [BLR90]) also show that T p reduces. Both of these approaches are beyond 
the scope of this book, so instead we assert without proof that T p reduces, 
i.e., that a commutative diagram (8.30) exists, and then granting that there is 
a reduction of T p , we will compute it. The resulting description of T p is called 
the Eichlcr Shimura Relation. 

The way to compute the reduction T p on Pic 0 (X± (N)) is to compute it first 
in the moduli space environment. We will establish a moduli space diagram 
analogous to (8.30) and then use Igusa's Theorem to translate it to modular 
curves. Recall the moduli space interpretation of the Hecke operator T p on 
the divisor group of Si (AT), 

T p [E,Q}=J2[E/C,Q + C]. (8.31) 

c 

The sum is taken over all order p subgroups C C E such that C fl (Q) = {0}, 
in this case all order p subgroups since p \ N . Let p C Z be a maximal ideal 
lying over p. By Proposition 8.4.4(b), if the curve E on the left side of (8.31) 
has ordinary reduction at p then so do all the curves E/C on the right side. 
Restricting to such curves, this section begins by describing the right side 
of (8.31) over Fp rather than over Q, the description being in terms of the 
Frobenius map. The same description will hold for curves with supersingular 
reduction at p. 

Let E be an elliptic curve over Q with ordinary reduction at p and let 
Q G E be a, point of order N. Let Co be the kernel of the reduction map 
E\p] — > E[p], an order p subgroup of E since the map surjects and the 
reduction is ordinary. 



Lemma 8.7.1. For any order p subgroup C of E, 
[E/C, Q + C] ■■ 



[E°*,Q°*]_ ifC = C 0 , 
[E^\[ P ]Q<] ifC^Co. 



Proof. First suppose C = C 0 . Let E 1 = E/C and let Q' = Q + C = ip(Q) 
where <p : E — > E' is the quotient isogeny. Let tp ■ E' — > E be the dual 
isogeny of ip and consider the commutative diagram 

E'\p) — ^— >£[p] 



E'\p] >E[p\. 

Since ^E has ordinary reduction at p so does its isogenous image E 1 , and 
thus |-E'[p]| = p. The subgroup ip(E'\p]) of E[p] has order p since |ker(^)| = 
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deg(V0 = P, it is a subgroup of C = kcr(ip) since ip(i[)(E'\p])) — [p](E'[p]) = 
{0}, and so it is all of C since both have order p. Thus ip(E' \p]) = C = Cq is the 
kernel of the right vertical arrow, and since the left vertical arrow surjects, the 
diagram shows that the bottom arrow is the zero map, i.e., E'[p] C ker(tp). On 
the other hand, the relation ipoip — [p]j^, shows that ker(^) C ker([p]) = E'[p\. 
In sum, ker(-0) is £7'[p], a group of order p. 

By Theorem 8.5.10(d), reduction at p preserves the degrees of the three 
maps in the relation [p] — tp o p, 

de g(fe) = V 2 , deg(£) = p, deg(^) = p. 

Since ker([p]g^) = E'[p] = ker(^) has order p, since deg sep = | ker | for isoge- 
nics as discussed in Section 8.2, and since the total degree is the product of 
the separable and inseparable degrees, this implies 

degsop (\p]jf') = P> de §ins (\p]w)=P 

and 

deg sop (^) = p, deg ins (^) = 1. 

Also, ipotp— [p]-g, by Theorem 8.5.10(b), and so since separable and insepa- 
rable degrees are multiplicative, 

de gscp(^) = 1. deg ins (<^) = p. 

The general factorization <p> = <p scp o where deg(</3 sep ) = deg sep (<p) and 
p e = deg ins (<p) specializes in this case to 



where i : E ap — > E' is an isomorphism taking Q CTp to Q' . The isomorphism 
gives an equivalence of enhanced elliptic curves so that 

= [&>,&"], 

proving the first case of the lemma. 

Now suppose C 7^ C 0 . Again let E' = E/C, let Q' = Q + C = <p(Q) where 
ip is the quotient isogeny, and let ip be the dual isogeny of (p. Analogously 
to C = ker(ip) and Co, let C = ker(-0) and let C' 0 be the kernel of the reduction 
map E'[p] — > E'\p], an order p subgroup of E'[p\. This time consider the 
commutative diagram 

E[p]—^E'[p\ 
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and consider the subgroup <p(Co) of E'\p]. This is an order p subgroup since 
Co 7^ C = ker(<^). It is a subgroup of C — ker(-0) since i/}(tp(Co)) = \p](Co) = 
{0}, making (p(Co) all of C since both have order p. Also <p(Co) is a subgroup 
of C 0 since the diagram commutes and Co is the kernel of the left arrow, 
making <p(Co) all of Cq. Thus C = Cq. Now the argument for the first case 
applies to E' and Q' and ip (which takes Q' to [p]Q) in place of E and Q 
and showing that 4> — i ° a p where i : E' — > E is an isomorphism 
taking Q' P to [p]Q. Applying 1 to the coefficients of i gives an isomorphism 

i a p : E' — > E a p taking Q' to . That is, it gives an equivalence of 

enhanced elliptic curves so that 

[E>,Q>] = [E<,\p]Q°»\ 
proving the second case of the lemma. □ 

There are p + 1 order p subgroups C of E, one of which is Co- Define the 
moduli space diamond operator in characteristic p to be 

(7) :Si(JV) — >Si(JV), [E,Q] i-> [E, [d]Q], (d,N) = l. 

Then summing the formula of Lemma 8.7.1 over all order p subgroups C C E 
gives for a curve E with ordinary reduction at p, 

Y}EJC,Q + C] = K+p(7) ( T p - 1 )[S,0]. (8.32) 

c 

Lemma 8.7.1 extends to supcrsingular curves. If E is an elliptic curve 
over Q with supersingular reduction at p and Q £ E is a point of order N, 
then for any order p subgroup C of E (Exercise 8.7.1), 

[Ejc,Q + C] = [E?;Q"] = [E<,\p]Q<]. 

Summing this over the p + 1 such subgroups C of E, using the first expres- 
sion for [E/C, Q + C] once and the second expression p times, shows that 
formula (8.32) applies to curves with supersingular reduction at p as well. 
Therefore it applies to all curves with good reduction at p. 

If an elliptic curve E over F p has invariant j ^ {0, 1728} then the same 
holds for E a p and E a p \ Formulas (8.31) and (8.32) combine with this obser- 
vation to give a commutative diagram, where as before the primes mean to 
avoid some points, only finitely many in characteristic p, 

Si(A0' gd ^Div(Si(JV)g d ) 

Si (TV)' <TP+P{p)a " > Div(Si(jV) / ). 
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That is, T p on Si(iV)g d reduces at p to a p + p(p)a p 1 . This extends to divisor 
groups and then restricts to degree-0 divisors, 



-►Div 0 ^)^) 



Div^Sx^)' ) 



Div°(Si(JV)') ^Div°(Si(iV)'). 

We also have a commutative diagram (Exercise 8.7.2) 

<Tp+P<P> cr" 1 



Div°(Si(JV)') 



Pic°(Xi(A0) 



-^Div 0 (Si(Af)') 



CTp,» + <P>,cr* 



>PiC°(X!(JV)). 



(8.33) 



(8.34) 



A cube-shaped diagram now exists as follows, where all squares except 
possibly the back one commute. 



Pic 0 {X 1 (AT)) 



Div o (S!(J\0y 



Div°(Si(iV)') 



Pic°(Xi(JV)) 



To establish this, note that 



-> Div^S^iV)^) 



->Div°(Si(JV)') 



-+Pic°(Xi(i\0) 



■>Pic°(Xi(JV)) 



• The top square, a diagram in characteristic 0 relating T p on the mod- 
uli space and on the modular curve, is a restriction of commutative dia- 
gram (7.18). 

• The side squares, relating reduction at p to the map from the moduli 
space to the modular curve, are commutative diagram (8.28) from Igusa's 
Theorem. 

• The front square, relating T p on the moduli space to reduction at p, is 
commutative diagram (8.33) resulting from Lemma 8.7.1. 

• The bottom square, relating maps in characteristic p on the moduli space 
and on the modular curve, is commutative diagram (8.34). 
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Let a temporarily denote a p ^ + {p) t a* on Pic (Xi(N)). Consider the map 
from the top front left corner of the cube down, then into the page, and then 
across, 

Div°(Si(A% d ) >Div°(Si(A0') ^"(X^N)) -^Pic° {X^N)) . 

(8.35) 

The composite of the first two stages surjects as explained after diagram (8.28). 
Since the left square of the cube commutes, (8.35) is 

Div 0 (Si(JV)^) >Pic°(X 1 (N)) >Pic°(X 1 (N)) -^Pic°(Xi(JV)), 

(8.36) 

where again the composite of the first two stages surjects. But since the bot- 
tom, front, right, and top squares of the cube commute, (8.35) also becomes 

Div 0 (S!(iV); d ) >Pic°(X 1 (N)) -^->Pfc o (Xi(i\0) >Pic°(X 1 (N)). 

Diagram (8.30) shows that a map f p : Pic°(Xi(7V)) — > Pic°(Xi(iV)) makes 
the back square of the cube commute. This changes the previous composite 
to 

Div 0 (S!(iV); d ) >Pic 0 (X 1 (N)) >Pic°(X 1 (N)) -^->Pic°(Xi(JV)). 

(8.37) 

Since (8.36) and (8.37) are equal, and their shared composite of the first two 
stages surjects, their respective maps er = <t P; * + (p)^a* and T p at the last 

stage are equal. That is, <r p ^ + (p)^<J p is the map that makes the back square 
of the cube commute. This gives 

Theorem 8.7.2 (Eichler-Shimura Relation). Let p \ N. The following 
diagram commutes: 

Pic°(XiW) >Pic o (*i(A0) 

Pic 0 (X, (TV)) >Pic°(X 1 (N)). 

In particular since (p) acts trivially on X 0 (N), the following diagram com- 
mutes as well: 

Pic°(X o (A0) >Pic°(X 0 (N)) 

(8.38) 

Pic 0 (X 0 (N)) a "' +ap ) Pic 0 (X 0 (N)). 
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The Hecke operator T p in characteristic p is now described in terms of the 
Frobenius map <j p , as desired. 

Exercises 

8.7.1. Let E be an elliptic curve over Q with supersingular reduction at p 
and let Q € E be a point of order N . Extend Lemma 8.7.1 to show that for 
any order p subgroup C of E, 

[Ejc,Q^C] = [E<*,Q*] = [E<,\p]Q<]. 
(A hint for this exercise is at the end of the book.) 

8.7.2. This exercise shows that diagram (8.34), the bottom square of the cube 
in the section, commutes. 

(a) Show that the following diagrams commute, where the map down the 
sides is described in Theorem 8.6.1: 

Div°(Si(JV)') ►Div 0 (Si(J\T) / ) 



Div°(Xi(iV)P lanar ) — ^ Div°(Xi(iV)P lanar ), 
Div°(Si(AT)') — ^Div°(Si(iV)') 



Div°(Xi(Ar)P lanar ) — ^ Div°(X!(iV)P lanar ). 

(Hints for this exercise are at the end of the book.) 

(b) Use Theorem 8.6.1 to show that the diagrams as in part (a) but with 
Pic°(Xi(7V)) in place of Div 0 (Xi(iV)P lanar ) on the bottom row commute. 

(c) Recall as a special case of Exercise 1.5.3 and Section 5.2 that the moduli 
space interpretation of the diamond operator (d) is 

(d) :Si(JV) — >Si(JV), [E,Q]^[E,[d]Q]. 

The section defined the moduli space diamond operator in characteristic p by 
the same formula, 

id) : Si(JV) — »• Si(JV), [E,Q] ^ [E, [d]Q]. 

This immediately gives a commutative diagram relating (d) on the moduli 
space to reduction, and the diagram extends to degree 0 divisors, 
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(d) 



Div u (S!(iV)' )^Div u (S 1 (iV)' ) 



Div°(Si(JV)') — ^->Div°(Si(JV)'). 



Show that the diamond operator in characteristic p on the moduli space and 
on the modular curve (cf. diagram (8.29)) are compatible in that the following 
diagram commutes: 



(d) 



Div°(Si(JV)') >Div°(Si(7V)') 



(d), 



(8.39) 



Pic°(Xi(JV)) ^Pic°(X!(iV)). 



(d) Use part (b) and diagram (8.39) to complete the proof that dia- 
gram (8.34) commutes. 
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Recall that if E is an elliptic curve over Q then its analytic conductor was 
defined in Section 7.7 as the smallest N such that X 0 (N) surjects to E, and 
its algebraic conductor Ne was described in Section 8.3. Both conductors 
depend only on the isogeny class over Q of E. In this section we simply call 
the algebraic conductor the conductor en route to explaining why the two 
conductors are in fact equal and no longer need to be distinguished. 

Theorem 8.8.1 (Modularity Theorem, Version a p ). Let E be an elliptic 
curve over Q with conductor Ne- Then for some newform f € ^(/^(-/Vs)), 

a,p(f) = a p (E) for all primes p. 

This version of Modularity is most obviously related to Version Aq, the 
version that provides a map A'j — ► E, since each version involves a new- 
form /, and unsurprisingly the two /'s are the same. But since A'j is a variety 
rather than a curve, and our policy is to argue using only curves, we give 
instead a partial proof that Version Xq, providing a map Xq(N) — ► E, im- 
plies this version. The argument necessarily requires a little effort to extract / 
from Xq(N) in consequence of avoiding varieties. Specifically, we prove 

Theorem 8.8.2. Let E be an elliptic curve over Q with conductor Ne, let N 
be a positive integer, and let 



a : X 0 (N) — > E 
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be a nonzero morphism over Q of curves over Q. Then for some newform 
f e S 2 (r 0 (M f )) where M f \N, 

a p (f) = a p {E) for all primes p\ NeN. 

After proving this we will touch on the rest of the argument that Ver- 
sion Xq of Modularity implies Version a p , and on the more natural-seeming 
argument starting from Version Aq. 

Proof. The route from a p (f) for some / to a p (E) is that 

• ci p (f) on A'j is T p for each /, by a variant of diagram (6.15), and a sum 
of factors A' f over all / is isogenous to Pic°(X 0 (N)), by a variant of The- 
orem 6.6.6, 

• T p on Pic°(X 0 (N)) reduces to + a* on Pic°(X 0 (iV)), b Y the Eichler- 
Shimura Relation (Xo(N) has good reduction at p because p\ N), 

• o>,*+CTp on Pic°(X 0 (N)) commutes with a* to become a p ^+a* on Pic°(£'), 
by formulas (8.17) and (8.19) (E has good reduction at p because p\ Ne), 

• and finally a p ^ + cr* on Pic (E) is a p (E), by Proposition 8.3.2. 

Recall some ideas from Section 6.6, given there for the group Ti(N) and 
now modified for T 0 (N). The complex vector space S2(T 0 (N)) has basis 

&2(N) ={J\J\J r(nr) 

f n a 

where the first union is taken over equivalence class representatives of new- 
forms / G S2(T 0 (Mf)) with Mf dividing N, the second over divisors n 
of N/Mf, and the third over embeddings a : C. Work over C now, 

identifying complex algebraic curves and Ricmann surfaces, and identifying 
Picard groups and Jacobians. Then the Picard group associated to T 0 (N) is 
isogenous to a direct sum of Abelian varieties, both sides being viewed as 
complex tori, 

Pic°(X o (AOc)^04,o 

f,n 

and there exists a dual isogeny 

04 iC ^Pic°(X o (AOc). 

f,n 

The given map a : X 0 (N) — > E extends to 

a c : X 0 (N)c — > E c , 

viewed as a morphism of complex algebraic curves or as a holomorphic map 
of compact Riemann surfaces, a surjection in either case. 
For any p \ NeN the diagram 
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04,c 



n/,„K(/)-a p (£)) 



>©4,c 



Pic°(X 0 (^) ( 



f,n 



-a„(E) ' a c » 

-^^Pic°(X 0 (7V) c ) —^Pic°(E c ) 



(8.40) 



has the following properties, to be proved in a moment: 

(a) If a p (f) ^ a p (E) then the top row surjects from @ n A'j c to ©„^4/ c- 

(b) The square commutes. 

(c) The composite map on the bottom row is zero. 

If a p (f) ^ Op(E) for some / and some p \ NeN then in the diagram for 
that p, mapping @ n A'j C across the top row takes it to all of ® n A' f c 
by property (a), and then mapping this down gives its isogenous image 
in Pic 0 (X 0 (./V)c). By property (b) the isogenous image also comes from map- 
ping 0 n A'j c down the left side of the diagram and then halfway across the 
bottom row. Property (c) now shows that the isogenous image of (J) n A'^ c 
lies in ker(ac,*)- Therefore, if for each / there exists a, p \ NeN such that 
a p (f) 7^ dp(E) then all of Pic°(X 0 (N) c ) lies in ker(ac,*)- But a c ,* surjects, 
so this is impossible. That is, there is a newform / such that a p (f) = a p (E) 
for all p \ NeN, as we needed to prove. 

Returning to diagram (8.40), to prove its property (a) let a p (f) ^ a p (E). 
Recall that a p (f) is an algebraic integer, and a p (E) is a rational integer. Thus 
their difference 5 satisfies a minimal monic polynomial with rational integer 
coefficients, 



S e + a 1 S e 1 + ha e _!(5 + a e = 

The resulting relation S(S e ~ 1 + aiS e ~ 2 



a 1 ,...,a e e Z, a e ^ 0. 
+ a e-i) = —o-e shows that S is 



surjective on A'j c as desired, since — a e is. 

Property (b) follows quickly from diagram (6.20) suitably modified from 

A (TV) tor 0 (JV). 

To prove property (c) switch back to working over Q and consider the 
following diagram: 



Pic°(X 0 (7V)) 



-Op(-E) 



>Pic u (X o (A0) 



^Pic°(.E) 



Pic°(X 0 (N)) a -^^Pic°(MN)) >Pic°(£) (8-41) 



a, „ ~ tT P ,,+o-*-a p (_E) 
■fi V- f ATW 5. -D\„" zr\ i. T3,-„0 



Pic u (X 0 (7V)) 



->Pic u (£) 



>Pic u (£). 



The top row here is a restriction of the bottom row of diagram (8.40), and it 
suffices to prove that this restriction doesn't surject (Exercise 8.8.1). The top 
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left square commutes since it is essentially (8.38) from the Eichler-Shimura 
Relation, and the top right square commutes by Corollary 8.5.9. The bottom 
rectangle commutes because formulas (8.17) and (8.19) from earlier in the 
chapter show that cr Pj * and a* commute with a*, and multiplication by a p (E) 
certainly commutes with the linear map a* as well. The second map on the 
bottom row is zero by Proposition 8.3.2, making the bottom row zero and thus 
making the middle row zero. So the top row followed by the right vertical arrow 
to the second row is zero. But the vertical arrow surjects by Theorem 8.5.9. 
This shows that the top row can't surject, as desired. With properties (a), (b), 
and (c) of diagram (8.40) established, the proof of Theorem 8.8.2 is complete. 

□ 

For Version Xq of Modularity to imply Version a p , the ncwform / of 
Theorem 8.8.2 needs to have level Mf = Ne and to have Fourier coefficients 
a p {f) = a p (E) for all p. 

Strong Multiplicity One shows that all the Fourier coefficients of / are 
rational integers. To see this, recall that if a is any automorphism of Q then 
the conjugate of / under a is defined as 

oo 

r(r) = ]>>„(/) v i . 

n=l 

According to Theorem 6.5.4 all conjugates of / are again newforms. In this 
case they are all equal to / itself because for all p\ N and all a, 

a P (f) = a P (E) => a p (f) eZ^ a p {JY = a p (f), 

so that f a = f by Strong Multiplicity One. This means that the Fourier 
coefficients of / are algebraic integers invariant under Aut(Q), i.e., the Fourier 
coefficients are rational integers as claimed. Thus / has number field K/ = Q. 

Obtaining Version a p of Modularity from here requires results beyond the 
scope of this book, but we sketch the ideas. As quoted in Section 7.7, the 
Abelian variety A'f, viewed as a variety over Q, has dimension [K/ : Q] = 1, 
i.e., A'f is an elliptic curve over Q. The map X 0 (N) — > Pic°(X 0 (N)) — ► A' 
is defined over Q, so we can run the proof of Theorem 8.8.2 again to show 
that there exists a newform g such that a p (g) — a p (A'f) for all but finitely 
many p. The proof in this case shows that g = f, so 

a,p(f) = a p {A'f) for all but finitely many p. (8.42) 

Carayol [Car86], building on the work of Eichler-Shimura, Langlands, and 
Dcligne, showed that in fact a p (f) — a p (A'f) for all p, and the level of / is the 
conductor of A'j, notated Mf — Nf, and this is also the analytic conductor 
of Af. But also, the work so far gives an isogeny over Q from A'f to E, so that 
a p (A'f) = a,p(E) for all p and the conductor of A'^ is the conductor of E, that 
is, Nf — Ne- This gives a p (f) — a p (E) for all p and Mf — Ne, i.e., Version a p 
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of Modularity. It also shows that the analytic and algebraic conductors of E 
are the same. That is, the newform / associated to E has the same level as 
the smallest modular curve Xq(N) that maps to E. 

Starting instead from Version Aq of Modularity that is, starting from a 
map a : A'j, — ► E, a more natural-looking proof that a p (f) — a p (E) for 
all p \ NeMj (where Ne is the conductor of E and Mf is the level of /) 
would set up a diagram analogous to (8.41), 



a P (f)-a p (E) 
> A t 



o p ,,+o* p -a p {E) ~ 

A f > A f 



^E 



_ o- p .»+CT*-a p (_E) 

-> E > E 



If a p (f) a p (E) then the top row is surjective, as is E — > E, but the bottom 
row is zero and hence the middle row is zero, giving a contradiction because all 
the rectangles commute. However, this argument makes use of the structure 
of A'j as a variety over Q, reduces the variety modulo p, and then makes 
further use of the structure of the reduction as a variety over F p , invoking 
algebraic geometry well beyond the scope of this book. By contrast our proof 
of Theorem 8.8.2 makes no reference to the variety structure of Pic°(X 0 (N)), 
and it uses Pic°(X 0 (N)), the Picard group of the reduced curve, rather than 
reducing an Abelian variety. 

Section 1.3 mentioned that some complex tori have complex multiplication, 
endomorphisms other than {[N] : N e Z}. This notion extends to algebraic el- 
liptic curves, providing more examples of Modularity. For example the elliptic 
curve 

E :y 2 = x 3 -d, d e Z, d ^ 0 

has the order 3 automorphism (x,y) >-> (fisx,y) over Q(ns), and its ring of 
endomorphisms is isomorphic to A = Z[/i3]. As in Exercise 8.3.6(b), a p (E) = 0 
for all p = 2 (mod 3) for which the displayed Weierstrass equation is minimal. 
The theory of complex multiplication in fact describes a p (E) for all p: there 
exist an integer N | 12 Hold?' anc ^ a character \ '■ (A/NA)* — > C* of order 6 



such that x( u ) 
(Z/JVZ)*, and 



for u G A*, \{a) = (a/3) (Lcgcndre symbol) for a e 

a p( E ) = \ x ^ n - 

\n\ 2 =p 

(See Chapter 2 of [Sil94] for proofs of these results, and see Exercise II. 5. 7 
of [Kob93] for the case d = —16, giving N = 3.) The solution-counts of E are 
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the Fourier coefficients of the function 

neA 

This function is a modular form of level 37V 2 by the weight 2 version of Hcckc's 
construction in Section 4.11. It is a normalized eigenform by an argument sim- 
ilar to the one in Section 5.9 for the weight 1 theta function. For the minimal 
choice of N, it is in fact a ncwform of level 37V 2 = Ne, thus illustrating Ver- 
sion a p of Modularity for these elliptic curves E. The elliptic curves E over Q 
with complex multiplication form a small class of examples — for example they 
have only a finite set of j-invariants — but they are important in number the- 
ory and Shimura's proof of Version Xq of Modularity for such curves [Shi71] 
provided evidence for the algebraic formulations of Modularity in general. 

Version a p of the Modularity Theorem rephrases in terms of L-functions. 
Recall from Chapter 5 that if / 6 <S2(/o(iV)) is a ncwform then its L-function 
is 

oo 

L(s, f) = J2 a n(f)n- s = J](l - %U)P~ S + IaKjV- 25 )- 1 , (8.43) 

n—1 p 

with convergence in a right half plane. For any elliptic curve E over Q let 1b 
be the trivial character modulo the conductor N of E as in Section 8.3. The 
Hasse-Weil L-function of E, encoding all the solution-counts a p (E), is 

L(s, E) = - a p (E)p- s + l E {p)p 1 - 2s )-\ 
v 

From Section 8.3 we have the definitions 

a p {E) = l, a p e(E) =p e + l- \E(F P *)\, e>l 
and the recurrence 

a p e(E) = a p (E)a r -i(E) - l E (p)pa p e-i(E) 1 e > 2. 

One more definition, 

a mn (E) = a m (E)a n (E), (m,n) = l, 

completes the analogy between the values a n (E) and the Fourier coefficients 
a n (f) of a ncwform. As in Chapter 5, the L-function of an elliptic curve is 
now 

oo 

L(s, E) = J2 a n (E)n- s = - a p (E)p- s + l^p 1 - 2 ')- 1 . (8.44) 

n— 1 p 

Half plane convergence of L(s, E) can be established as well by estimat- 
ing a p (E). But this is unnecessary since comparing the products in (8.43) 
and (8.44) shows that Version a p of Modularity is equivalent to 
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Theorem 8.8.3 (Modularity Theorem, Version L). Let E be an elliptic 
curve over Q with conductor N E . Then for some newform f £ 5 2 (/o(-/V B )), 

L(s,f)=L(s,E). 

Faltings's Isogcny Theorem (Corollary 5.2 in Chapter 2 of [CS86]), a deep 
result, now shows that this Version L of Modularity implies 

Theorem 8.8.4 (Modularity Theorem, strong Version ^4q). Let E be 

an elliptic curve over Q with conductor Ne- Then for some newform f £ 
S2(Tq(Ne)) the Abelian variety A'f is also an elliptic curve over Q and there 
exists an isogeny over Q 

4 E - 

To see this, suppose that by Version L we have L(s, f) = L(s, E). Then / 
has rational coefficients, making A'j an elliptic curve. Equation (8.42) shows 
that L(s, A'j) and L(s,f) have the same Euler product factors for all but 
finitely many primes p. So the same is true of L(s, A'j) and L(s, E), and now 
Faltings's Theorem gives an isogeny A'j — > E. 

Version L of the Modularity Theorem shows that the half plane conver- 
gence, analytic continuation, and functional equation of L(s, f) from Theo- 
rem 5.10.2 now apply to L(s,E). This is important because the continued 
L(s, E) is conjectured to contain sophisticated information about the group 
structure of E. Specifically, since E(Q) is a finitely generated Abelian group 
it takes the form 

e(q) = re z r , 

where T is the torsion subgroup and r is the rank. The rank is much harder 
to compute than the torsion. However, 

Conjecture 8.8.5 (Weak Birch and Swinnerton-Dyer Conjecture). 

Let E be an elliptic curve defined over Q. Then the order of vanishing of 
L(s,E) at s = 1 is the rank of E(Q). That is, if E(Q) has rank r then 

L(s,E) = (s-iy 9 (s), 0(l)^O,oo. 

The original half plane of convergence of L(s,E) is {Re(s) > 2}, and the 
functional equation then determines L(s, E) for Rc(s) < 0, but the behavior 
of L(s, E) at the center of the remaining strip {0 < Re(s) < 2} is what 
conjecturally determines the rank of E(Q). The Birch and Swinnerton-Dyer 
Conjecture would give an algorithm for finding all rational points on elliptic 
curves, and it would give an effective method for finding imaginary quadratic 
fields with a given class number. For more on the Birch and Swinnerton-Dyer 
Conjecture see [Tat02], Appendix C.16 of [Sil86], or Chapter 17 of [Hus04]. 



8.8 Fourier coefficients, //-functions, and Modularity 363 



Exercise 

8.8.1. Let /3c denote the map of the bottom row of diagram (8.40), 

n a,o(T p -a p (E)) n 

fa : Pic°(A 0 (A) c ) > Pic°(£ c ). 

Augment /3c to get a map of complex algebraic curves, 

7c : X 0 (N) >Pic°(A 0 (A)c) -^Pic°(£ c ) ^c- 

Here the first stage is P i-> \{P) — {Pq)\ where Pq € X 0 (N)c is a base point, and 
the third stage is >->• J2Qi- Since 7c is a composite of holomorphic 

maps, it is holomorphic as a map of Riemann surfaces and therefore it is a 
morphism as a map of complex algebraic curves. 

(a) Show that if 7c is zero then /3c is zero. (A hint for this exercise is at 
the end of the book.) 

(b) Assume that the base point Po in part (a) has algebraic coordinates. 
Show that 7c is defined over Q as follows. It suffices to show that 7 C = 7c 
for all a e Aut(C/Q). Compute that for any P e A 0 (A)c and any a, 

ic{p°) = i c {py = ic(p a ). 

Since P a can be any point of A 0 (A)c, this gives the result. 

(c) Let (3 denote the map of the top row of diagram (8.41), 

n a,o(T p -a p (E)) n 

0 : Pic 0 (Ac, (A)) > Pic°(£). 

This is the restriction of /3c to Q-points. Consider the corresponding restric- 
tion of 7c , viewed as a morphism over Q of algebraic curves over Q according 
to part (b), 

7 : A 0 (A) >Pic°(A 0 (A)) ^^Pic°(^) >E. 

Use the maps of curves 7 and 7c to show that if /3 does not surject then /3c 
is zero. This was used in proving property (c) of diagram (8.40). The result is 
immediate from quoting that as morphisms from varieties to curves, /3 and /3c 
are both constant or surjective, but the argument in this exercise uses only 
the algebraic geometry of curves. 
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Galois Representations 



This book has explained the idea that all elliptic curves over Q arise from mod- 
ular forms. Chapters 1 and 2 introduced elliptic curves and modular curves as 
Ricmann surfaces, and Chapter 1 described elliptic curves as algebraic curves 
over C. As a general principle, information about mathematical objects can 
be obtained from related algebraic structures. Elliptic curves already form 
Abclian groups. Modular curves do not, but Chapter 3 showed that the com- 
plex vector space of weight 2 cusp forms associated to a modular curve has 
dimension equal to the genus of the curve, Chapter 5 defined the Hccke op- 
erators, linear operators that act on the vector space, and Chapter 6 showed 
that integral homology is a lattice in the dual space and is stable under the 
Hecke action. 

As number theorists we are interested in equations over number fields, 
in particular elliptic curves over Q. Chapter 7 showed that modular curves 
are defined over Q as well. As another general principle, information about 
equations can be obtained by reducing them modulo primes p. Chapter 8 
reduced the equations of elliptic curves and modular curves to obtain similar 
relations for the two kinds of curve: for an elliptic curve E over Q, 

a p (E) = Op.* + a* as an endomorphism of Pic°(_E), 
while for the modular curve X 0 (N) the Eichler-Shimura Relation is 

T p = cr p ,* + c* as an endomorphism of Pic (X Q (N)). 

These relations hold for all but finitely many p, and each involves different 
geometric objects as p varies. 

Using more algebraic structure, this last chapter lifts the two relations 
from characteristic p to characteristic 0. For any prime I the €-power torsion 
groups of an elliptic curve give rise to a vector space Vi(E) over the l-adic 
number field Q^. Similarly, the ^-power torsion groups of the Picard group of 
a modular curve give an £-adic vector space Vg(X). The vector spaces Vg(E) 
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and Ve(X) are acted on by the absolute Galois group of Q, the group Gq of 
automorphisms of the algebraic closure Q. This group subsumes the Galois 
groups of all number fields, and it contains absolute Frobenius elements Frob p 
for maximal ideals p of Z lying over rational primes p. The vector spaces Ve(X) 
are also acted on by the Hecke algebra. The two relations in the previous 
paragraph lead to the relations 

Frobp — a p (E)Froh p + p = 0 as an endomorphism of Ve(E) 

and 

Frobp - T p Frobp + p = 0 as an endomorphism of Ve(X 0 (N)). 

These hold for a dense set of elements Frobp in Gq, but now each involves a 
single vector space as Frobp varies. The second relation connects the Hecke 
action and the Galois action on the vector spaces associated to modular curves. 

The vector spaces Vi are Galois representations of the group Gq. The 
Galois representation associated to a modular curve decomposes into pieces 
associated to modular forms. The Modularity Theorem in this context is that 
the Galois representation associated to any elliptic curve over Q arises from 
such a piece. This fits into a larger problem, to show that Galois representa- 
tions from algebraic geometry arise from modular forms. 

This chapter provides less background than the rest of the book. It quotes 
results from algebraic number theory and it uses techniques from algebra, es- 
pecially tensors, without comment. Related reading: Chapter 15 of [Hus04], 
Chapter IIL7 of [Sil86], Section 7.4 of [Shi73]. The volumes [Mur95] and 
[CSS97] contain lectures on the proof of Fermat's Last Theorem, and [DDT94] 
is a survey article. Henri Darmon's article in [Mur95] discusses the conjecture 
of Serre mentioned at the end of the chapter. 

9.1 Galois number fields 

Recall from Section 6.4 that a number field is a field FcQ such that the 
degree [F : Q] is finite, and each number field has its ring of algebraic inte- 
gers Of- This chapter will work with number fields F such that the extension 
F/Q is Galois. These fields are notated F rather than K to emphasize that 
they play a different role from other number fields earlier in the book, but the 
reader is cautioned that F ? continues to denote a finite field. The purpose of 
this section is to illustrate some results from algebraic number theory in the 
Galois case by giving specific examples, without proof, to convey a concrete 
sense of the ideas before we start using them. The reader without background 
in algebraic number theory is strongly encouraged to refer to a text on the 
subject. 
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Let F be a Galois number field and let p € Z be prime. There are positive 
integers e, /, and g that describe the ideal pOp as a product of maximal ideals 
of Of, 



pO F = (pi • • • p g ) e , 0 F /p l £* Fp/ for i = 1, 



e/ 5 =[F:Q]. 



The first formula is (8.22) specialized to the Galois case. The ramification 
degree e says how many times each maximal ideal p of Of that lies over p 
repeats as a factor of pOp- There are only finitely many p such that e > 1, 
the primes that ramify in F. The residue degree f is the dimension of the 
residue field f p = Of/P (a finite field) as a vector space over F p = Z/pZ for 
any p over p. The decomposition index g is the number of distinct p over p. 
The condition efg = [F : Q] says that the net measure of ramification degree, 
residue degree, and decomposition index associated to each rational prime p 
is the field extension degree. Equivalently, efg = |Gal(F/Q)|. 
The simplest Galois number fields are the quadratic fields. Let 

F = Q(Vd), d e Z squarefree. 

Then [F : Q] = 2 and the extension F/Q is Galois with its group generated 
by the automorphism taking Vd to — Vd. The discriminant of the field is 



A-p = 



Ad if 1 (mod 4), 
d if d = 1 (mod 4), 



and the ring of algebraic integers in this field is 



O f = Z 



A* 



/Zf" 



:a,beZ 



if d 



This says that O f = Z[Vd) if d ^ 1 (mod 4) while O f = Z 
1 (mod 4) , but phrasing results in terms of Ap and thus making no more direct 
reference to cases is tidier. The Legendre symbol from elementary number 
theory, (a/p) where a € Z and p is an odd prime, extends to incorporate the 
Kronecker symbol, defined only for a = 0, 1 (mod 4), 



1 if a = 1 (mod 8), 
-1 if a = 5 (mod 8), 
0 if a = 0 (mod 4). 



This makes the behavior of a rational prime p in Of easy to notate, 



pO F 



pq if(A F / P )= 1, 
p if (A F /p) = -1, 



P 2 if(4 P /p) 



0. 
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The content of this formula is that the odd primes p such that d is a quadratic 
residue modulo p decompose in O-p, the odd primes such that d is a nonresiduc 
remain inert, and the odd primes dividing d ramify, while 2 decomposes if 
d = 1 (mod 8), remains inert if d = 5 (mod 8), and ramifies if d ^ 1 (mod 4). 

Another family of simple Galois number fields is the cyclotomic fields. Let 
iV be a positive integer and let 

F = Q( Miv ), fi N = e 2 ^ N . 

Then [F : Q] = (f)(N) (Euler totient) and the extension F/Q is Galois with 
group isomorphic to (Z/NZ)*, 

Gal(F/Q) (Z/NZ)*, (n N ^ ^n) > a (mod N). (9.1) 

The cyclotomic integers are 

Op = Z[^ N ] = {a 0 + ai//jv H 1- a^-i^^ 1 : a 0 , . . . , ajv_i e Z}. 

Each rational prime not dividing AT is unramified in F, 

pO F = Pi ---Pg, P\N, 

and its residue degree / is the order of p (mod N) in (Z/NZ)*. The primes 
dividing TV ramify in Q(/xjv)- We do not need a precise description of their 
behavior since we will focus on the unramified primes. 

For the simplest non-Abelian Galois group, let d > 1 be a cubefree integer, 
let d 1 / 3 denote the real cube root of d, and let 

F = Q(d 1 /3, M3 ). 

In this case [F : Q] = 6 and Gal(F/Q) is isomorphic to S3, the symmetric 
group on three letters. The Galois group is generated by 

/rfV3 M3d l/3\ ^ / d l/3 ^ d l/3\ 

CT ' V ^3 ^ / ' T ' V ^3 >-> f4 J ' 

and the isomorphism (noncanonical) is 



Gal(F/Q) S 3 , (j 1 v (12 3), t i ^ (2 3). 

The rational primes not dividing 3c? are unramified in F, and their behavior 
is (Exercise 9.1.1) 





fpl • " "P6 


if p = 


1 (mod 3) and d is a cube modulo p, 


pO f = 1 


| P1P2 


if p = 


1 (mod 3) and d is not a cube modulo p, (9.2) 




[P1P2P3 


if p = 


2 (mod 3). 



Returning to the general situation, again let F be a Galois number field 
and let p be a rational prime. For each maximal ideal p of Of lying over p 
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the decomposition group of p is the subgroup of the Galois group that fixes p 
as a set, 

Dp = {a € Gal(F/Q) : p a = p}. 

The decomposition group has order ef, so its index in Gal(F/Q) is indeed the 
decomposition index g. By its definition it acts on the residue field f p = Of/P, 

(x + p) a = x a + p, x e Of, o e D p . 

The inertia group of p is the kernel of the action, 

Ip = {a € Dp : x a = x (mod p) for all x <E Of}- 

The inertia group has order e, so it is trivial for all p lying over any unram- 
ified p. Recall the Frobcnius automorphism a p : x i-> x p in characteristic p 
from Chapter 8. If we view F p = Z/pZ as a subfield of f p = Of/P — F p / then 
there is an injection 

Dp/Ip — ► Gal(f p /F p ) = (o-p). 

Since both groups have order /, the injection is an isomorphism and the 
quotient D p /Ip has a generator that maps to a p . Any representative of this 
generator in D p is called a Frobenius elementoi Gal(F/Q) and denoted Frob p . 
That is, Frobp is any element of a particular coset alp in the subgroup Dp 
of Gal(F/Q). Its action on F, restricted to Of, descends to the residue 
field f p = Of/P, where it is the action of a p . When p is unramified, mak- 
ing the inertia group I p trivial, Frob p is unique. To summarize, 

Definition 9.1.1. Let F/Q be a Galois extension. Let p be a rational prime 
and let p be a maximal ideal of Of lying over p. A Frobenius element 
o/Gal(F/Q) is any element Frobp satisfying the condition 

Thus Frobp acts on the residue field f p as the Frobenius automorphism a p . 

When F/Q is Galois the Galois group acts transitively on the maximal 
ideals lying over p. i.e., given any two such ideals p and p' there is an auto- 
morphism a G Gal(F/Q) such that 

p a = p'. 

(We made reference to this fact, and to the earlier-mentioned fact that Dp 
surjects to Gal(f p /F p ), in the proof of Theorem 8.5.4.) The associated decom- 
position and inertia groups satisfy 

Dp<? = <T~ 1 DpCr, Ipa = <j~ 1 IpO~, 

and the relation between corresponding Frobenius elements is 
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Frobp^ = a L Frob p a. 

If p is ramified then this means that the conjugate of a Frobenius is a Frobcnius 
of the conjugate. The relation shows that if the Galois group is Abclian then 
Frobp for any p lying over p can be denoted Frob p . 

To compute the Frobenius element in the quadratic field case, again let 
F = Q(Vd) where d € Z is squarefree. The Galois group Gal(F/Q) consists 
of the identity and the map taking V ' A F to — \fA~p. Let p be a maximal ideal 
of O f lying over an odd prime p. Each a + b(A F + ^/3p)/2 e O f reduces to 
the residue field f p , with a, b, and A F reducing to its subfield F p and with 2 
reducing to F^. Using the same symbols for the reductions, compute in the 
residue field that 



(p-i)/2 




This shows that the Frobenius element is the Legendre symbol in that Frob p 
multiplies y/A~F~ by Z\^~ 1)/2 = (A F /p) for odd primes p. There are infinitely 
many such p such that (A F /p) = 1, and similarly for (A F /p) = —1. Therefore 
every element of the Galois group of F takes the form Frob p for infinitely 
many p, and there is an isomorphism 

Gal(F/Q) {±1}, Frobp ^ (A F /p) for odd p\ A F . 

The Frobenius automorphism has a natural description in the cyclotomic 
case F = Q(/ijv) as well. For any prime p \ N let p lie over p and note that 
in the residue field f p = F p [^jv], again using the same symbols to denote 
reductions, 

/ y 

| \ ^ m | \ pm 

\ m / m 

This shows that Frob p is the element of Gal(Q(/iAr)/Q) that takes /xjv to n P N , 
and thus the isomorphism (9.1) takes Frob p to p (mod N). By Dirichlet's 
Theorem on Arithmetic Progressions, for each a e (Z/NZ)* there are in- 
finitely many p such that p = a (mod N). Therefore every clement of the 
Galois group of F again takes the form Frob p for infinitely many p, and the 
isomorphism (9.1) is 

Gal(F/Q) (Z/NZ)*, Frob p ^ p (mod N) for p\ N. (9.3) 

In the non-Abelian example F = Q(c? 1 / 3 , /x 3 ) since the conjugacy classes in 
any symmetric group S n are specified by the cycle structure of their elements, 
in this case of 53 they are 

{1}, {(12),(23),(31)}, {(123),(132)}. 

So the conjugacy class of an element of S3 is determined by the element's order, 
and therefore this holds in Gal(F/Q) as well. To determine the conjugacy class 
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of Frobp it thus suffices to determine its order, the residue degree / of the 
prime p lying under p. Formula (9.2) and the formula efg = 6 combine to 
show that for any unramified rational prime p, i.e., p \ 3d, the associated 
conjugacy class 

{Frobp : p lies over p} (9.4) 

is 



the elements of order < 



1 if p = 1 (mod 3) and d is a cube modulo p, 
3 if p = 1 (mod 3) and d is not a cube modulo p, 

2 ifp=2(mod3). 

Each conjugacy class takes the form (9.4) for infinitely many p, as with the 
previous two fields. This fact, as well as Dirichlet's Theorem, is a special case 
of 

Theorem 9.1.2 (Tchebotarov Density Theorem, weak version). Let 

F be a Galois number field. Then every element o/Gal(F/Q) takes the form 
Frobp for infinitely many maximal ideals p of Of- 

We end this section with a motivating example. There is an embedding 
of S3 in GL 2 (Z) (Exercise 9.1.2) such that 



(123) 1 y 



(2 3) 1 y 



0 1 

1 0 



(9.5) 



Again letting F = Q(c? 1 / 3 ,^ 3 ) this gives a representation 

p : Gal(F/Q) — > GL 2 (Z). 

The trace of p is a well defined function on conjugacy classes (9.4) and there- 
fore depends only on the underlying unramified rational primes p, 



tr p (Frobp) 



2 if p — 1 (mod 3) and d is a cube modulo p, 
-1 if p = 1 (mod 3) and d is not a cube modulo p, (9.6) 
0 ifp=2(mod3). 



Recall from Section 4.11 the theta function 0 x (t) <G Mi(3N 2 ,tp) where TV = 
3 ripidP an d i> is the quadratic character with conductor 3. Formula (9.6) 
for tr p(Frobp) matches formula (4.51) for the Fourier coefficient a p (9 x ) when 
p \ 3d. Similarly the determinant of p is defined on conjugacy classes over 
unramified primes, 



det p(Frobp) 



if p 
if p 



1 (mod 3), 

2 (mod 3). 



This is tp(p)- So the Galois group representation p, as described by its trace 
and determinant on Frobcnius elements, arises from the modular form 6 X . This 
modular form is a normalized eigenform by Section 5.9 and a cusp form by 
Exercise 5.11.3. The idea of this chapter is that 2-dimensional representations 
of Galois groups arise from such modular forms in great generality. 
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Exercises 

9.1.1. Let F = Q(d 1 /3 j/i3 ) be the non-Abclian number field from the section. 
Granting that the factorization of p\ 3d in Q(d 1 / 3 ) comes from reducing x 3 — d 
modulo p, obtain the factorization of p in F. (A hint for this exercise is at the 
end of the book.) 

9.1.2. Show that (9.5) defines an embedding of S3 in GL 2 (Z). 

9.2 The l-adic integers and the £-adic numbers 

For the remainder of this chapter let £ denote a prime number. 
Consider an affine algebraic curve over Q, 

C:xy=l. 

The points of C are identified with the Abelian group Q* via (x, y) i-> x, so C 
has an Abelian group structure as well. For each n £ Z + the points of order £ n 
form a subgroup C\l n \. Since this is identified with the cyclic group of £™th 
roots of unity there is an isomorphism 

C[£ n ] — > Z/fZ, fa 1 V a. (9.7) 

Consequently there is also an isomorphism 

Aut(C[r]) — > (Z/fZ)*, (jip ^ m. 

This extends to the isomorphism (9.1) with N = £ n , 

Gal(Q(^n)/Q) — > (Z/fZ)*, (ntn i-> /iji) i-> m. (9.8) 

Amalgamate the number fields Q(/i^n) for all n by defining 

00 

Q(m<«) = (J Q(w)- 

n=l 

This is again a field, a subfield of Q but not a number field since it has infinite 
degree over Q, placing it outside classical Galois theory. Still, since Q(/x £0 o) 
is a union of Galois number fields whose Galois groups act on finite groups, 
its automorphism group should be a limit of Galois groups and act on a limit 
of finite groups. To make this idea precise, let 

G Qi£ = Aut(Q(/x £0 c)). 

Every a <G Gqj restricts to a n e Gal(Q(/^n)/Q) for each n £ Z + . The 
restrictions form a compatible sequence, 
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(o-i, cr 2 , o 3 , . . . ), <t„ + i|q( w „) = ct„ for all n. 

Conversely, every such compatible sequence of automorphisms determines an 
automorphism a £ Gq £. Thus Gq i can be viewed as a group of compatible 
sequences, where the group operation is componentwise composition. Each 
sequence acts componentwise on the group of compatible sequences of £-power 
roots of unity, the i-adic Tate module of C , 

Ta,(G) = ^1, $1, ■■■)■■ = AV" for a11 

where the group operation is componentwise multiplication. Under the iso- 
morphisms (9.7) and (9.8) at each component, Ta^(G) is isomorphic to the 
Abelian group of sequences 

{(ai, d2, 03, . . . ) : o n € Z/£ n Z and a„ + i = a„ (mod i" 1 ) for all n}, 

where the group operation is componentwise addition, and Gq^ is isomorphic 
to the Abelian group of sequences 

{(mi, m 2 , m 3 , . . . ) : m n € (Z/£ n Z)* and m n+ i = m„ (mod £") for all n}, 

where the group operation is componentwise multiplication. The action of Gq^ 
on Ta^(G) transfers to an action of this group on the previous one by compo- 
nentwise multiplication as well. These constructions motivate 

Definition 9.2.1. Let £ be prime. An i-adic integer is a sequence 

a = (ai, a 2 , a 3 , . . . ) 

with a n <E Z/£ n Z and a n+ \ = a n (mod £ n Z) for each n e Z + . The ring of 
£-adic integers, where the operations are componentwise addition and multi- 
plication, is denoted Zg. 

Thus each entry a n in an £-adic integer determines the preceding en- 
tries a n -i down to a\, while the entry a n+1 to its right is one of its £ lifts 
from Z/£ n Z to Z/£ n+1 Z. This makes the £-adic integers a special case of an 
algebraic construct called the inverse limit, in this case the inverse limit of 
the rings Z/fZ for n e Z+, 

Z e = lim{Z/rZ}. 

n 

The ring Zg is an integral domain. The natural map 

Z — > Z e , an (a + £Z, a + £ 2 Z, a + £ 3 Z, ...) 

is a ring injection, so we view Z as a subring of Z(. The map induces a natural 
isomorphism 

Zj£Z ^fZ t J£Ze, a + £Z^a + £Z e , 
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so we identify ZjlZ and ZgjlZg. As the inverse limit of a system of finite 
rings, Zg is pro finite. 

The multiplicative group of units in Zg is 

ZJ = {(ai, 02, 03, • • • ) G Z £ : a„ e (Z/fZ)* for all n}, 

for given such a compatible sequence, the sequence of inverses modulo £ n is 
again compatible. Every £-adic integer a with a\ ^ 0 in Zj£Z is invertible. 
The ideal £Zg is the unique maximal ideal of Zg, and Z* e = Zg—£Zg. The ideal 
structure of Z e is Zg D £Zg D £ 2 Zg D ■■■ . 
To put a topology on Zg let 

U x (n) = x + fZ £ , x eZg, n e Z+. 

If .t e Z| then also U x (n) = x(l + £ n Z* e ), or 

U x (n) = x • kcr (Z* — ->■ (Z/fZ)*), a; G Z*, n e Z+. 

Then a basis of the topology of Z^ is {t/ x (n) : x e Zg, n e Z + }, meaning that 
the topology is the smallest topology containing all U x (n). The neighborhoods 
Ui(n) will figure frequently in the discussion, so they are abbreviated to U(n). 
The ring operations of Zg are continuous (Exercise 9.2.1(a)). As the inverse 
limit of finite rings, the topological ring Zg is compact (Exercise 9.2.1(b)). 
For any positive integer d the Z^-module Zf has the product topology, a basis 
being {U v (n) : v £ Z^,n € Z + } where now U v {n) = v + £ n Zf. We remark 
that the topology of Zg arises from extending the ^-adic valuation of Chapter 8 
from Z to Zg and then defining a corresponding metric, but in this chapter 
only the topology matters. 

The field of i-adic numbers is the field of quotients of Zg. A basis of the 
topology of Qg is {U x (n) : x £ Q^, n G Z + } where as before U x {n) = x+£ n Zg. 
The field operations of are continuous (Exercise 9.2.1(c)). For any positive 
integer d the Q^-vector space has the product topology, a basis being 
{U v (n) : v € Qg, n £ Z + }. The matrix group GLd(Qg) acquires a topology as 
a subset of Qf . If m e GL d (Z e ) then U m (n) = m(I + £ n M d (Zg)), or 

U m (n) — m • kcr (GL d (Z £ ) — > GL d (Z/rZ)), m G GL d (Z,), 

and as before, C/(n) means Uj(n). Since matrix multiplication and matrix 
inversion are defined by rational functions of the matrix entries, they are 
continuous under this topology. Vector-by-matrix multiplication, 

Q d e x GL d (Qg) — > Q £ d , 

is continuous as well. Here we view vectors as rows so that matrices multi- 
ply from the right; this is not the usual convention but in this chapter such 
multiplication will represent the action of Galois groups, and these have been 
acting from the right since their appearance in Chapter 7. 
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Let V be a finite-dimensional vector space over Q^. Any basis B = 
. . . , (3d) of V determines a topology T on V to make the coordinate 
map cb ■ V — ► given by J2 a jflj ^ [ a i:---:°d] a homeomorphism. If 
B' is another basis determining another topology T' then the identity map 
(V, T) — > (V, T') is Cg] o too cb where to : — > Qf is multiplication by 
a transition matrix between the bases. This is a homeomorphism, so T' = T ■ 
That is, V acquires a topology as a finite-dimensional vector space over 
independently of coordinates. 

Let K be any number field, not necessarily Galois over Q, and let Ok 
be its ring of integers. The factorization (8.22) of £Ok into maximal ideals is 
written 

^ K =n ACA ' 

x\e 

where the notation A | I means that A lies over the rational prime I. Similarly 
to Zg and Q^, for each A the ring of X-adic integers is the inverse limit 

O k ,a = lim{0 K /A n } 

n 

and then the field of X-adic numbers is the field of quotients Ka of Ok, a- 
We may view Ze as a subring of Ok, a and as a subfield of Ka (Exer- 
cise 9.2.3(a)). Define the residue degree f\ to be [k^ : F^] where k^ = Ok/A. 
Then the containments Ze C Ok, a and C Ka arc equalities when e\f\ = 1 
(Exercise 9.2.3(b)), and in fact \K\ : Qe] = e\f\. There is a ring isomorphism 

x\t 

We provide the proof to illustrate properties of tensors and inverse limits for 
the less experienced reader. Compute that 

O k <8> Z e = lim{0 K <8> Z/fZ} = lim{0 K /^"0 K }. 

n n 

But Ok/^Ok = O k / 11a A neA = I1a 0 K /X n ^. Thus 

O k ® Z t = lhn{[[0 K /X n ^} = JJlim{0 K /A ne n = ]JO K X , 

n A A n A 

and this gives 

K ®d Qe = O k <8> = O k <8> Z e Qe = J[{Ok,x ®z t Qe) = JJ K A . 

A A 

Each K\ acquires a topology as a finite-dimensional vector space over Q^. 
Let V be a finite-dimensional vector space over some K\. Then V is also 
a finite-dimensional vector space over Q^. Thus V acquires two topologies, 
from K A and from Q^, but since Ka acquires its topology in turn from 
the two topologies are the same. Any finitely generated Z^-submodule of V is 
compact since Ze is compact. The results in these last two paragraphs will be 
used in Sections 9.3 and 9.5. 
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Exercises 

9.2.1. (a) Show that the ring operations of are continuous under the £-adic 
topology. 

(b) Show that the £-adic topology makes a topological subgroup of 
the product P — Yl^Li Z/^™Z. The product is compact by the Tychonov 
Theorem (see for example [MunOO]). For each n consider the finite product 
P n = rim=i Z/f m Z and the projection 7r„ : P — > P n . The product topology 
makes the projections continuous. Show that each P n contains a closed sub- 
group C n of compatible elements such that Z^ = fX^ =1 (C n ) . This shows 
that Z£ is a closed subgroup of P, making it compact. (A hint for this exercise 
is at the end of the book.) 

(c) Show that the field operations of are continuous under the f-adic 
topology. 

9.2.2. Since U(n) is a subgroup of Z| for all n e Z + , every neighborhood U 
of 1 in Q| contains a nontrivial subgroup of Q^. Show that the analogous 
result fails with C in place of Q^, and similarly for GLd(Q^) versus GLd(C). 
(A hint for this exercise is at the end of the book.) 

9.2.3. (a) Show that X nex n Z = £ n Z for all n € Z+, and that therefore we 
may view Z e as a subring of Ok, a and as a subficld of K A . (Hints for this 
exercise are at the end of the book.) 

(b) Show that the containments are equalities when e\f\ = 1. 

(c) What does (9.9) say for the three Galois number fields of Section 9.1? 

9.3 Galois representations 

As always, let Q denote the algebraic closure of Q. The automorphisms of Q 
form the absolute Galois group of Q, 

Gq = Aut(Q). 

Recall that I is prime. This section defines ^-adic Galois representations as 
homomorphisms from Gq into £-adic matrix groups, or alternatively as £-adic 
vector spaces that are GQ-modules. 

The algebraic closure Q is the union of all Galois number fields. Any au- 
tomorphism a G Gq fixes Q pointwise and restricts to an automorphism 
ct|f € Gal(F/Q) for every Galois number field F. Restriction from Gq 
to Gal(F/Q) surjects. The restrictions are compatible in that 

a-p = op' |f if FcF'. 

Conversely, every compatible system of automorphisms {of} over all Galois 
number fields F defines an automorphism of Q, i.e., an element of Gq. As in 
the previous section this describes Gq as an inverse limit, 
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G Q = Hm{Gal(F/Q)}. 

F 

Since the Galois groups are finite, Gq is a profinitc group. A good theory arises 
from giving it a suitable topology, the Krull topology, and then considering 
only its continuous actions. For each automorphism a e Gq and each Galois 
number field F there is an associated coset of a normal subgroup that gives 
rise to a finite quotient, U a (F) = {err : t|f = 1}, or 

U„(F) = a • kcr (Gq — > Gal(F/Q)). 

Again U(F) denotes U\(F) where now 1 = lq-. A basis of the Krull topology 
on Gq is 

{U a (F) : a e Gq, F is a Galois number field}. 

Thus every U(F) is an open normal subgroup of Gq, and conversely every 
open normal subgroup of Gq takes the form U(F) for a Galois number field F 
(Exercise 9.3.1). As the inverse limit of finite groups, the topological group Gq 
is compact. 

One clement of Gq is complex conjugation. For a family of elements, let 
p € Z be any prime and let p C Z be any maximal ideal over p. Use p as the 
kernel in defining the reduction map Z — > F p . The decomposition group of p 
is 

D p = {aeG Q :p° = p}. 

Thus each a e D p acts on Z/p as (x + p) a = x a + p, and this can be viewed 
as an action on F p . Let Gf p denote the absolute Galois group Aut(F p ) of F p . 
The reduction map 

D P — ► G Fp 

is surjective. An absolute Frobenius element over pis any preimage Frobp € D p 
of the Frobenius automorphism a p e Gf p • Thus Frob p is defined only up to 
the kernel of the reduction map, the inertia group of p, 

Ip = {cr e D p : x a = x (mod p) for all x G Z}. 

For each Galois number field F the restriction map Gq — > Gal(F/Q) takes 
an absolute Frobenius element to a corresponding Frobenius element for F, 

Frobp | F = Frobp F where p F = P H F. (9.10) 

All maximal ideals of Z over p are conjugate to p, and the definition of Frob p 
shows that analogously to before, 

Frobp^ = (T _1 Frob p cr, a € Gq. 

Again this means that the conjugate of a Frobenius is a Frobenius of the 
conjugate. Theorem 9.1.2 and the definition of the Krull topology combine to 
show that we now have many examples (Exercise 9.3.2): 
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Theorem 9.3.1. For each maximal ideal p of Z lying over any but a finite 
set of rational primes p, choose an absolute Frobenius element Frob p . The set 
of such elements forms a dense subset of Gq. 

Let x '■ (Z/NZ)* — > C* be a primitive Dirichlet character. Consider the 
following diagram, in which ttn is restriction to Gal(Q(^jv) /Q), the horizontal 
arrow is the isomorphism (9.1), and p x ^ is defined to make the triangle 
commute. 

Gq 

7TJV 

Gal(Q(/ijv)/Q) > (Z/NZ)* (9.11) 




C* 



The diagram shows that x determines a homomorphism 
Px — Px,n ° ■ Gq — > C* . 

In particular, since complex conjugation in Gq restricts to pn >-> in the 
finite Galois group it follows that p x (conj) = 1). Similarly, since (9.10) and 
then (9.3) show that an absolute Frobenius element Frobp in Gq lying over a 
prime p \ N maps to p (mod N) in (Z/NZ)* it follows that p x (Frob p ) = \{p) 
for such p. To show that the homomorphism p x is continuous it suffices to 
check that / o^ 1 (l) is open (see Exercise 9.3.3), and this holds because the 
inverse image is U(F) for some Galois number field F C Q(mjv)- 

Conversely, any continuous homomorphism p : Gq — > C* has fi- 
nite image (Exercise 9.3.4) and thus factors through some Abclian Galois 
group Gal(F/Q). The Kronecker-Weber Theorem states that we may take 
F = Q(/xjv) for some N, and now diagram (9.11) shows that p = p x for some 
Dirichlet character \ p . Taking N as small as possible (see Exercise 9.3.5) 
makes Xp primitive. That is, Dirichlet characters lead to homomorphisms of a 
certain type, and all such homomorphisms arise from Dirichlet characters. Sec- 
tion 9.6 will discuss a conjectured 2-dimcnsional analog of this 1-dimcnsional 
correspondence, encompassing the Modularity Theorem. 

Exercise 9.3.4 shows more generally that any continuous homomorphism 
P '■ Gq — > GLd(C) has finite image and therefore captures only a finite 
amount of the structure of Gq. However, the image of a Dirichlet character x 
lies in a number field K and therefore in a field K.\ where A lies over any given 
rational prime I. So C* can be replaced by in diagram (9.11), and the 
argument that the associated representation p x : Gq — > K* x is continuous 
still applies. This suggests the following definition. 

Definition 9.3.2. Let d be a positive integer. A d- dimensional i-adic Ga- 
lois representation is a continuous homomorphism 
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p : G Q — > GL d (L) 

where L is a finite extension field ofQe- If p' : Gq — >• GLd(L) is another such 
representation and there is a matrix m e GLrf(L) swc/i i/iat p'(cr) = m~ 1 p(a)m 
for all a e Gq i/ien p and p' are equivalent. Equivalence is notated p <~ p' . 

We state without proof that every finite extension field L of takes the 
form K\ for some number field K and maximal ideal A | I of Ok, and that 
for such L the ring Ol = Ok. a is well defined independently of K and A. 
The ring Ol is a lattice in L, i.e., there is Z^-basis of Ol that is also a 
Q^-basis of L. The £-adic Galois representations in this chapter will often use 
L = Qe, but for example Section 9.5 will construct representations using fields 
L = K^a where / is a modular form and is the number field generated by 
its Fourier coefficients. Complex Galois representations are defined similarly 
to Definition 9.3.2, but because their images are finite they will not play a 
role here until the end of the chapter. Galois representations are assumed from 
now on to be €-adic rather than complex unless otherwise specified. 

For an example involving infinite structure, first note that the field con- 
tainment Q(/i£oo) C Q gives the group surjection Gq — > Gq^ restricting 
each automorphism of Q to Q(^°°)- Follow this by the group isomorphism 
Gq^ — > Z| described before Definition 9.2.1 to get the i-adic cyclotomic 
character of Gq , 

Xi '■ Gq — ► Q* e , u i-> (mi, 77j 2 , m 3, • ■ • ) where u% n = p^fr for all n. 

To show that the cyclotomic character is continuous it suffices to check that 
Xj 1 (V (n)) is open for any n (again see Exercise 9.3.3), and this holds because 
the inverse image is U(Q(p,g™)). Thus xe is a 1-dimensional Galois represen- 
tation. Similarly to the Dirichlct character example, x^(conj) = —1, and we 
will frequently use the formula (Exercise 9.3.6) 

X £(Frob p )=p iorp^L (9.12) 

This is independent of how p and Frob p are chosen. The formula shows that 
despite being 1-dimensional xe has infinite image, already indicating that l- 
adic Galois representations have a richer theory than complex ones. Profinitc 
groups GL d (L) provide better views of the profinite group Gq. 

Given a Galois representation p we naturally want to know values p{u) 
for a G Gq. In particular we want to evaluate p at absolute Frobenius ele- 
ments. But each Frob p is defined only up to the absolute inertia group I p , 
so the notion of p(Frob p ) is well defined if and only if I p C kcr p. Further- 
more, if p and p' lie over the same rational prime p then the inertia groups I p 
and Ip> are conjugate in Gq, so the condition I p C ker p depends only on the 
underlying prime p since ker p is normal in Gq . The condition makes sense 
at the level of the similarity class of p, again since kcr p is normal in Gq . 
Although p(Frob p ) does depend on the choice of p over p when it is defined, 
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its characteristic polynomial depends only on the conjugacy class of p(Frob p ) 
and therefore only on p. 

Definition 9.3.3. Let p be a Galois representation and let p be prime. Then 
p is unramified at p if I p C kcr p for any maximal ideal p C Z lying over p. 

For example, let \ '■ (Z/7VZ)* — > C* be a primitive Dirichlct character, 
let p be any prime not dividing TV, and let p lie over p. Then in diagram (9.11) 
the map tt n takes I p into I Pn where p N = pnQ(^Tv)- But L Pn = {1} since p is 
unramified in Q(/ijv), and so I p C kerp x , i.e., p x is unramified at p. Similarly 
for the cyclotomic character xe, let P ^ £ and let p lie over p. Any a € I p 
acts trivially on F„ = Q(/i^ ) for all positive n since p is unramified in F n . 
This shows that I p C hsrxi, i.e., xi 1S unramified at p. In general, the Galois 
representations that we will consider are unramified at all but finitely many 
primes p. For such a representation the values p(Frob p ) for p over unramified p 
determine p everywhere by continuity. 

To think about xi m another way, recall that it arises from the GQ-modulc 
structure of the Tate module of G. Let Vg(C) = Ta^(G) <g> Q, a 1-dimensional 
vector space over Q^. Then the following diagram commutes, where the map 
from Vi(C) to is the Q^-lincar extension of the isomorphism Ta£(C) — > 
described before Definition 9.2.1, and the map from Gq to Q| is Xi- 

V e (C) x G Q ► V e (C) 



Qi x Q* > Q e 

Once Vg(C) is topologized compatibly with the action of Gq on Ve(C) 
is continuous by the continuity of all the other maps in the diagram. This 
example motivates a second definition of an ^-adic Galois representation as a 
vector space over with GQ-module structure. 

Definition 9.3.4. Let d be a positive integer. A d- dimensional l-adic Ga- 
lois representation is a d-dimensional topological vector space V over L, 
where L is a finite extension field of Qg, that is also a GQ-module such that 
the action 

V x Gq — > V, 0, a) i— > v a 

is continuous. Lf V is another such representation and there is a continuous 
GQ-module isomorphism of h-vector spaces V V then V and V are 
equivalent. 

The two definitions of an ^-adic Galois representation are compatible (Ex- 
ercise 9.3.7). Similarly to how the 1-dimensional representation Vi(C) arises 
from the GQ-modulc structure of Ta^(G), the next two sections will consider 
higher-dimensional Galois representations arising from the Tate modules of 
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elliptic curves, Jacobians, and Abelian varieties associated to modular forms. 
Any 1-dimcnsional Galois representation has an Abelian image but the higher- 
dimensional representations capture non-Abelian structure of Gq as well. 

The values taken by a Galois representation p x : Gq — > J£\ arising from a 
Dirichlet character are roots of unity and therefore they lie in the subring Ok, a 
of Ka- The values taken by the cyclotomic character \i lie in the subring Z^ 
of Qg. The second definition of Galois representation helps to show that this 
is a general phenomenon. 

Proposition 9.3.5. Let p : Gq — > GL^(L) be a Galois representation. Then 
p is similar to a Galois representation p' : Gq — > GL^Ol). 

Proof. Let V = L d and let A — O^. Then A is a lattice of V, hence a finitely 
generated Z^-module, hence compact as noted at the end of Section 9.2. Since 
Gq is compact as well, so is the image A' of A x Gq under the map V x 
Gq — > V. Thus the image lies in \~ r A for some r € Z + . The image is 
finitely generated, it contains A so its rank is at least d, it is free since Ol is a 
principal ideal domain, and so its rank is exactly d. It is taken to itself by Gq . 
All of this combines to show that any 0L-basis of A' gives the desired p' . □ 

This result will be used in Section 9.6. 
Exercises 

9.3.1. Show that every open normal subgroup of Gq takes the form J7(F) for 
a Galois number field F. (A hint for this exercise is at the end of the book.) 

9.3.2. Show that Theorem 9.3.1 follows from Theorem 9.1.2. (A hint for this 
exercise is at the end of the book.) 

9.3.3. Let p : G — > H be & homomorphism of topological groups. Show 
that p is continuous if and only if /9 _1 (V) is open for each V in a basis of 
neighborhoods of the identity 1#. 

9.3.4. Show that any continuous homomorphism p : Gq — > GLd(C) factors 
through Gal(F/Q) — ► GL d (C) for some Galois number field F. Hence its 
image is finite. (A hint for this exercise is at the end of the book.) 

9.3.5. Suppose that p : Gq — ► C* factors through Gal(Q(/ijv)/Q) and also 
factors through Gal(Q(/ijv')/Q)- Show that p factors through Gal(Q(/i n )/Q) 
where n = gcd(7V, TV'). (A hint for this exercise is at the end of the book.) 

9.3.6. Establish formula (9.12). 

9.3.7. (a) Let p : Gq — > GL d (L) be a Galois representation as in Defi- 
nition 9.3.2. Show that this makes L d a Galois representation as in Defini- 
tion 9.3.4. On the other hand, let V be a d-dimcnsional ^-adic Galois repre- 
sentation as in Definition 9.3.4. Show that any choice of ordered basis of V 



382 9 Galois Representations 



gives a representation p : Gq — > GLd(L) as in Definition 9.3.2. Thus the 
two definitions of Galois representation are compatible. Remember to check 
continuity. (Hints for this exercise are at the end of the book.) 

(b) Show that the notions of equivalence in the two definitions are also 
compatible. 

9.4 Galois representations and elliptic curves 

This section associates 2-dimensional Galois representations to elliptic curves. 

Let E be an elliptic curve over Q and let £ be prime. Multiplication by £ 
on ^-powcr torsion of E gives maps 

E[£]i E[£ 2 }< E[l 3 ]< . . . 

The £-adic Tate module of E is the resulting inverse limit, 

Ta e (E) =lim{E[£ n }}. 

n 

Choose a basis (P n , Q n ) of E[t n ] for each n E Z + with the choices compatible 
in that each basis is a lift of its predecessor, 

[£}P n+1 =P n and [e]Q n+1 = Q n , n e Z+. 

Each basis determines an isomorphism E[£ n ] —} (Z/£ n Z) 2 , and so 

T* t {E) = Z\. 

For each n the field Q(^[£™]) is a Galois number field, giving a restriction 
map 

G Q — ► Gal(Q(S[r])/Q), ^a| Q{E[P]) , 
and there is also an injection 

Gal(Q(£[r])/Q) — > Aut(E[£ n }). 

These maps are compatible in that for each n the following diagram commutes. 

G Q 




Aut(£^"]) <— - - Aut(E[£ n+1 }) 

That is, the Tate module is indeed a GQ-module. Each basis (P n ,Q n ) de- 
termines an isomorphism Aut(S[£™]) GL 2 (Z/^"Z), and these combine to 
give Aut(Ta^(S)) GL 2 (Z£). Since Gq acts on Ta,g(E) the net result is a 
homomorphism 
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Pej ■ G Q — > GL 2 (Z £ ) c GL 2 (Q £ ). 

The homomorphism is continuous by an argument similar to the one for xe 
in the previous section (Exercise 9.4.1). Thus pe.i is a Galois representation, 
the 2-dimensional Galois representation associated to E. 

Theorem 9.4.1. Let I be 'prime and let E be an elliptic curve over Q with 
conductor N . The Galois representation pE.t is unramified at every prime 
p \ £N. For any such p let p C Z be any maximal ideal over p. Then the 
characteristic equation of pe, e(Frob p ) is 

x 2 - a p (E)x + p = 0. 

The Galois representation pE,e « s irreducible. 

Alternatively, the vector space Vg(E) = Ta^(D) <£> Q is the 2-dimensional 
Galois representation as in Definition 9.3.4 associated to E, and a diagram as 
follows commutes. 

V t (E) x G Q > V e (E) 

Q £ 2 xGL 2 (Q,) >Q| 

The theorem can be rephrased appropriately, e.g., its displayed equation is 
the characteristic equation of Frob p itself as an automorphism of Ve(E). 

Proof. Let p \ £N and let p lie over p. Recall the absolute Galois group of F p , 
Gf p = Aut(Fp). There is a commutative diagram 

Dp ►Aut 

Gf p >Au.t(E[£ n ]) 

where the top map restricts the action of Gq on E to D p and the bottom 
map is given by the action of Gf p on E. The inertia group I v is contained 
in the kernel of the map down the left side and then across the bottom. 
The map down the right side is an isomorphism since the condition p \ IN 
means that E has good reduction at p and the reduction preserves ^"-torsion 
structure, cf. Section 8.3, Theorem 8.1.2, and Theorem 8.4.4. Consequently, 
I p is contained in the kernel of the map across the top. Since n is arbitrary 
this means that I v C hdr pej as desired. 

To establish the characteristic polynomial of Frobp for p \ IN we need 
to compute det pE,e (Frobp) and tr pE,e(Frob p ). For the determinant, let p n : 
Gq — > GL 2 (Z/£"Z) be the nth entry of pE,e f° r 71 <= Z + . As in Lemma 7.6.1, 
the Weil pairing shows that the action of a e Gq on the root of unity u £n is 
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given by the determinant, but by definition the action is also to raise /j,gn to 
the nth entry of the cyclotomic character Xf(cr), 

,,cr _ dctp„(er) _ Xt,n(<r) 

— — AV 

That is, detp„(<r) = xe,n(&) in (Z/£ n Z)* for all n, so det PE,e(cr) — Xii 17 ) 
in Zf . In particular (9.12) gives 

detp B ^(Frob p ) = p. 

For the trace, let A = pE,t(Froh p ) and recall that a square matrix satisfies its 
characteristic polynomial. Since det A = p the characteristic equation is A 2 — 
(tr A)A + p = 0, and this rewrites as tr A = A + pA^ 1 . Proposition 8.3.2 says 
that <7 p .* + a* = dp(E), this being an equality of endomorphisms of Pic°(-E). 
Recall from equations (8.14) and (8.15) that <7 P .» acts as a v and a* acts as 

po-p 1 - The diagram in this proof shows that a v acts on E[£ n ] as Frob Pn acts 
on E[£ n \. That is, A + pA- 1 = a p (E) (mod t n ) for all n. Since this holds for 
all n, it follows that tr A = a p (E), or 

trp£,£(Frobp) = a p (im- 
proving the last statement of the theorem is beyond the scope of this 
book. □ 

Exercises 

9.4.1. Show that pE,e is continuous. 

9.4.2. Recall from Section 8.3 that we have defined 

a r {E)=p e + l-\E(Y p e)\, e>l, ai {E) = 1, 

shown that 

a p e(E) = a p e tt + cr*e, e > 1, 
and stated that there is a recurrence 

a pC (E) — a p {E)a pC -i(E) — l E (p)pa p a-2(E) for all e > 2, 

where 1^ is the trivial character modulo the conductor N of E. This exercise 
proves the recurrence except when p = 2 and 2 \ N. 

(a) Show that the end of the proof of Theorem 9.4.1 extends to show that 

tr(A e ) = a r {E), e > 1. 

(b) Let Ai and A2 be the eigenvalues of A. Show that 

tr {A e ) = \'i + X e 2 = (Ar + A 2 )(Ar 1 + A^" 1 ) - A^A^ 2 + X^ 2 ). 
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Use this and the identities detA = p, tr A = a p (E) to prove the recurrence 
for p \ £N, so that letting I vary proves the recurrence for p \ N. 

(c) For p | N, assume p is odd. By (8.10) we may take E : y 2 = x 3 + ax 2 
with a e F p . 

The reduction E is multiplicative if a ^ 0. Show that in this case if a 
is a square in F^ (where q = p e ) then the formula t i-> (t 2 — a,t(t 2 — a)) 

describes a map from P 1 (F 9 ) to E(F q ) that surjects and that injects except 
for hitting (0, 0) twice; but if a is a nonsquare in F* then the map injects but 
misses (0,0). Use this to compute a p e(E) for all e, depending on whether a is 
a square or not in F^ , and to prove the recurrence in each case. 

The reduction is additive if a = 0. Show that in this case the formula 
t i-> (t 2 ,t 3 ) describes a bijection from P 1 (F 9 ) to E(F q ). Use this to compute 
a p n (E) for all e and to prove the recurrence. 

9.4.3. Let E and E' be isogenous elliptic curves over Q. Show that their Galois 
representations Vg(E) and Ve(E') are equivalent. Faltings's Isogeny Theorem, 
mentioned in Section 8.8, states the converse. (A hint for this exercise is at 
the end of the book.) 

9.4.4. This exercise gives an example of a mod I representation. Such repre- 
sentations will play a role in Section 9.6. 

(a) Let E be an elliptic curve over Q with conductor TV and let i be prime. 
Thus E[£] = F 2 . The mod I representation of E is 

P E ,i ■ Gq — ► Gal(Q(£M)/Q) — ► GL 2 (F,). 

Explain why for any maximal ideal p G Z over any p \ £N, the characteristic 
equation of p B ^(Frob p ) is x 2 — a p (E)x + p = 0. (Here the coefficients are 
inF/.) 

(b) Consider the elliptic curve E : y 2 = x 3 — d where d € Z is cubefree, 
and let I = 2. Show that Q(_E[2]) is the non-Abelian Galois number field 
F = Q(d 1 / 3 ,^i 3 ) from Section 9.1. Therefore the embedding (9.5) followed by 
reduction modulo 2 gives an isomorphism Gal(Q(S[2])/Q) — > GL 2 (F 2 ). Use 
this to describe a p (E) (mod 2). 

(c) Let E be any elliptic curve over Q with conductor N. Suppose that 
£7[£](Q) 7^ {0}, i.e., E has an ^-torsion point with rational coordinates. Show 
that for any maximal ideal p G Z over any p \ £N the characteristic equation 
of p £ ^(Frob p ) is (x — l)(x — p) = 0. Explain how this shows that £'[^](Q) 
cannot be all of E[£] for £ > 2, cf. the function field result from Section 7.6 
that Q(j, Ej[N]) contains fi N . (A hint for this exercise is at the end of the 
book.) 
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9.5 Galois representations and modular forms 

This section associates Galois representations to modular curves and then 
decomposes them into 2-dimensional representations associated to modular 
forms. 

Let N be a positive integer and let £ be prime. The modular curve Xi(N) 
is a projective nonsingular algebraic curve over Q. Let g denote its genus. The 
curve X\(N)c over C defined by the same equations can also be viewed as a 
compact Ricmann surface. As in Chapter 6 the Jacobian of the modular curve 
is a (/-dimensional complex torus obtained from integration modulo homology, 

Ji(JV) = Jac(X 1 (7V) c ) = S 2 (r 1 (N)) A /R 1 (X 1 (N) c , Z) = C/A g . 

The Picard group of the modular curve is the Abelian group of divisor classes 
on the points of Xi(N), 

Pic°(Xi(iV)) = Div°(Xi(N))/Div e (Xi(N)). 

By the methods of Section 7.9, Pic u (Xi(A^)) can be identified with a subgroup 
of Pic 0 (Xi(N)c) , and the complex Picard group is naturally isomorphic to 
the Jacobian by Abel's Theorem as in Section 6.1. Thus there is an inclusion 
of ^"-torsion, 

i n : Pic 0 (X!(iV))[r] — ► Pic°(Xi(JV) c )[^] = (Z/fZ) 2 ». 

Recall that Igusa's Theorem (Theorem 8.6.1) states that Xi(N) has good 
reduction at primes p \ N, so also there is a natural surjective reduction map 
Pic°(Ai(A0) — > Pic°(Ai(7V)) restricting to 

tt„ : Pic 0 (*i TOP"] — ► Pic 0 ^ (#))[*"]. 

We state without proof two generalizations of facts we have used about elliptic 
curves: 

• The inclusion i n is in fact an isomorphism. 

• So is the surjection 7r„ for p \ £N. 

These follow from results of algebraic geometry. Specifically, if a curve X 
over a field k has genus g and M is coprimc to char(k) then Pic°(A)[M] = 
(Z/MZ) 2g , and if a curve X over Q has good reduction at a prime p\M then 
the reduction map is injective on Pic (X)[M]. 
The i-adic Tate module of X\{N) is 

Ta*(Pic°(Xi(i\0)) = Hm{Pic 0 (Xi(JV))[r]}. 

n 

Similarly to the previous section, choosing bases of Pic 0 (Xi(N))[£ n ] compat- 
ibly for all n shows that 
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Ta<(Pic o (Xi(i\0)) = Z? fl - 
Any automorphism cr G Gq defines an automorphism of Div°(Xi(N)) , 

(^n P (P)) ff = Y j n P {P°)- 

Since div(/) cr = div(/ CT ) for any / e Q(Ai(7V)), the automorphism descends 
toPic°(Xi(JV)), 

Pic°(Xi(JV)) xG q ^ Pic°(Xi(JV)). (9.13) 

The field extension Q(Pic°(X 1 (7V))[r])/Q is Galois for each n e Z+, so the 
action restricts to Pic°(Xi(N))[£ n ]. For each n there is a commutative diagram 

G Q 



Aut (Pic 0 (Xi (N))[£ n ]) < Aut(Pic°(A 1 (AO)[r+ 1 ]). 

Again as in the previous section this leads to a continuous homomorphism 

Px!(N),e ■ Gq — > GL 2g (Ze) c GL 2g (Qe)- 

This is the 2g-dimensional Galois representation associated to Xi(N). 

Recall from Chapter 6 that the Hccke algebra over Z is the algebra of 
cndomorphisms of <S 2 (A(A0) generated over Z by the Hecke operators, 

T Z = Z[{T„» :n€Z+}]. 

The Hecke algebra acts on Pic°(Xi(N)), cf. the bottom rows of diagrams 
(7.18) and (7.19), 

T z x Pic°(Ai(A0) — > Pic 0 (A!(A^)). (9.14) 

Since the action is linear it restricts to ^-power torsion, and so it extends 
to Ta*(Pic°(Xi(iV))). From Section 7.9 the Hecke action is defined over Q. So 
the Galois action (9.13) and the Hecke action (9.14) on Pic (Xi(N)) commute, 
and therefore so do the two actions on Ta^Pic (Xi(N))). 

Theorem 9.5.1. Let £ be prime and let N be a positive integer. The Galois 
representation Pxi(N),t * s unramified at every prime p \ £N . For any such p 
let p C Z be any maximal ideal over p. Then Px 1 (N).i(Frob p ) satisfies the 
polynomial equation 

x 2 -T p x+(p)p = 0. 
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Similarly to how we have abbreviated T Pi * to T p on Pic°(Xi(iV)) all along, 
the last formula in the theorem omits the asterisk from the subscript of both 
Hecke operators on Ta^(Pic°(Xi(7V))). As in the previous section, the vector 
space Vi(Xi(N)) = Ta^(Pic (Xi(N))) (g> Q can be taken as the Galois repre- 
sentation rather than px t {N)i, there is a corresponding commutative diagram, 
and the theorem can be rephrased appropriately. 

Proof. Let p \ £N and let p lie over p. As in the proof of Theorem 9.4.1 there 
is a commutative diagram 

D p - — >Aut(Pic°(Xi(JV))[r]) 

G Fp >Aut(Pic 0 (Xi(AO)[^"]). 

The map down the right side is an isomorphism as explained at the beginning 
of the section. Similarly to before, I p C ker Px 1 (N),e- 

For the second part, the Eichler-Shimura Relation (Theorem 8.7.2) re- 
stricts to £ "-torsion, 

Pic 0 (Xi (AT)) [f^] >Pic°{X 1 {N))[£ n ] 

Pic 0 ( X x ( N) ) [£ n ] CTp '* +<P> * gp > Pic 0 (Xi(N))[£ n ]. 

The same diagram but with Frob p + (p)pFrob p 1 across the top row instead 
also commutes, cf. (8.15). Since the vertical arrows are isomorphisms, T p = 
Frobp + (p)pFrobp- 1 on Pic 0 (X^N) )[£ n ]. This holds for all n, so the equality 

extends to Ta £ (Pic°(A\(iV))). The result follows. □ 

To proceed from Picard groups to modular forms, consider a normalized 
cigenform 

feS 2 (N, x ). 

Recall from Chapter 6 that the Hecke algebra contains an ideal associated 
to /, the kernel of the eigenvalue map, 

// = {T € T z : Tf = 0}, 

and the Abelian variety of / is defined as 

A f = 3 1 (N)/I f 3 1 (N). 

This is a complex analytic object. We do not define an algebraic version of 
it because its role here is auxiliary. By (6.12) and Exercise 6.5.2 there is an 
isomorphism 
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Tz/If-^Of where Of = Z[{a n {f) ■ n € Z+}]. 

Under this isomorphism each Fourier coefficient a p (f) acts on Af as T p + If. 
Also, Of contains the values xi n ) f° r 71 € Z+ and x(p) acts on A/ as (p) + If. 
The ring 0/ generates the number field of /, denoted K/. The extension 
degree d = [K/ : Q] is also the dimension of Af as a complex torus. As with 
elliptic curves and modular curves, the Abclian variety has an ^-adic Tate 
module, 

Ta,(A / )=lnn{A / n} = Zf. 

n 

The action of Of on ij is defined on ^-power torsion and thus extends to an 
action on Ta,g(Af). The following lemma shows that Gq acts on Ta^(^4/) as 
well. 

Lemma 9.5.2. The map Pic 0 (Xi(N))[£ n ] — > A f [£ n ] is a surjection. Its ker- 
nel is stable under Gq. 

Proof. Multiplication by i n is surjective on the complex torus Ji(iV). This 
makes it surjective on 7/Ji(7V) as well. Indeed, any y G IfJi(N) takes the 
form y = £\ T t y, with T, G I f and Vl G Ji(iV) = £ n 3i{N) for each i, so 
V = E l T t (t n x t ) = £ n £ 2 T iXi e e n IfJi{N) as desired. 

To show the first statement of the lemma, take any y G Af\l n \. Then y = 
x + IfJi(N) for some x G Ji(AT) such that r.x G IfJi(N). Thus fi = fi' 
for some x' G IfJi(N) by the previous paragraph. The difference x — x' lies 
in Ji(iV)[r] = Pic°(Xi(7V))[£"] and maps to y as desired. 

The kernel is Pic°(Xi(7V))[r] n I f Ji(N) = (I f 3i{N))[e n ]. We claim that 
the inclusion (7 / Pic°(Xi (N) ))[l n \ C (7 / Ji(AT))[f l ] is in fact an equality. To 
sec this, let S 2 = S 2 {r 1 (N)) and 77i = Hi(A 1 (A^) c ,Z) C 5 2 A . Thus Ji(N) = 
5^/77i and 

7 / J 1 (7V) = (7 /( S 2 A + 770/77! = I f S£/(Hi n //5 2 A ). (9.15) 

Proposition 6.2.4 shows that 7/77i is a subgroup of i7i n IfS 2 with some 
finite index M. This shows that M(H\ n IfS 2 ) C If Hi. Now suppose 
that y G (7 / J 1 (iV))[f n ]. Then (9.15) shows that y = x + Hi n J/«S£ with 
a; G IfS 2 , and since £ n y = 0 this implies t n x G 77i n IfS 2 . Therefore 
Mfi G M(77r n I f S£) C 7/77!, and so x G 7 / (M- 1 £-"i7 1 ). It follows that 
y G If{3i{N)[Ml n }) C 7 / Pic°(Ai(7V)), and since £ n y = 0 the equality is 
proved. Thus the kernel is (7/ (Pic 0 {Xi(N) ))[£"]. This is stable under Gq as 
desired since the Galois and Hecke actions on Pic°(Xi(7V)) commute. □ 

So Gq acts on ^4/[^™] and therefore on T&e(Af). The action commutes 
with the action of Of since the GQ-action and the Tz-action commute 
on Ta^(Pic 0 (Xi(iV))). Choosing coordinates appropriately gives a Galois rep- 
resentation 

PA f ,l ■ Gq — ► GL 2 d(Ql)- 



390 9 Galois Representations 



This is continuous because px 1 (N),e is continuous and (Exercise 9.5.1) 

P~x\(N)A U ^9)) C p- A ) A {U{n,d)), (9.16) 

where U(n,g) = kcr (GL 2g (Z e ) — > GL 2s (Z/rZ)) and similarly for U(n,d). 
The representation is unramificd at all primes p \ £N since its kernel con- 
tains ker px-i_{N),i- For any such p let p C Z be any maximal ideal over p. 
At the level of Abelian varieties, since T p acts as a p (f) and (p) acts as x(p), 
PA f ,e(Frob p ) satisfies the polynomial equation 

x 2 - a p (f)x + x(p)p = 0. 

The Tate module Ta^( Af) has rank 2d over Z^. Since it is an ©/-module the 
tensor product Ve(Af) = Ta^(A/)(g)Q is amodule over Of®Qe — K/^qQ^. 

Lemma 9.5.3. Ve(Af) is a free module of rank 2 over K/ ®q Q^. 

Proof. Again \c\_S 2 = 5 2 (A(iV)) and_i?i = Hi(Xi(AT) c , Z) C S£. Consider 
the quotients 5^ = S£/I f S£ and i?i = (Hi + I f S£)/I f S$. Then A/ = 
S£/{Hi + I f sf) = S£/H-l. Thus F x is an ©/-module whose Z-rank is 2d. 
Since K/ is a field, i?i <E> Q is a free K/-modulc of rank 2, and therefore 
Hi ® = # i ® Q ®q is free of rank 2 over the ring K / ®q Q^. 

The O^-linear isomorphisms l~ n H\j H\ — > H\j£ n Hi induced by multi- 
plication by £ n on £~ n H\ assemble to give an isomorphism of O/tgiZf-modulcs, 

Ta £ (A/) = lim{A f [£ n }} = lim{r n H 1 /7T 1 } = \^{H 1 /£ n H 1 } = H 1 ® Z t , 

n n n 

where the transition maps in the last inverse limit are the natural projection 
maps. And now Ve(A f ) = Ta, e (Af) <g> Q = Hi <g> Z e ® Q = Hi <g> Q £ is an 
isomorphism of modules over 0/ <g) Z^ <g) Q = K/ <g> Q^, showing that Vg(Af) 
is free. □ 

The absolute Galois group Gq acts (K/ ®q Q^-linearly on Vf(A/), and 
V((Af) = (Kf <E)q Qe) 2 by the lemma. Choose a basis B of V^(.A/) to get 
a homomorphism Gq — > GL 2 (K/ ®q Q^). Also, (9.9) specializes to give 
K/ ®q = f3 A | £ K /^a, so for each A we can compose the homomorphism 
with a projection to get 

Pf,x : G Q — ► GL 2 (K/, A ). 

This is continuous (Exercise 9.5.2(b)), making it a Galois representation. And 
ker(pA f .i) C ker(p/. A ) (Exercise 9.5.2(c)). We have proved 

Theorem 9.5.4. Let f e S 2 (N, \) be a normalized eigenform with number 
field K/. Let I be prime. For each maximal ideal A of On f lying over £ there 
is a 2- dimensional Galois representation 
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Pf,X ■ Gq — > GL 2 (K/ ; a)- 

This representation is unramified at every prime p \ IN. For any such p 
let p C Z be any maximal ideal lying over p. Then p/^(Frob p ) satisfies the 
polynomial equation 

x 2 - a p (f)x + x(p)p = 0. 
In particular, if f € S2(T 0 (N)) then the relation is x 2 — a p (f)x + p = 0. 

Exercises 

9.5.1. Establish (9.16). (A hint for this exercise is at the end of the book.) 

9.5.2. (a) Let i : Kf ®q Q e — ► IIa|<^/>* be the isomorphism of (9.9). 
For each A, let e\ be the element of Kf ®q that is taken by i to 
(0, . . . , 0, 1 K/ A , 0, . . . , 0) and let V ft x = e\{V t {A f )). Show that each V ft x is 
a 2-dimensional vector space over K/,a and that 

Ve(Af) = @Vf, x . 
x\e 

Show that each Vf t \ is invariant under the Gcj-action on Ve(Af). Show that 
if each Vf t \ is given the basis e\B over Kf t \ where B is the basis of Ve(Af) 
over Kf ®q in the section then each pf t \ is defined by the action of Gq 
on Vf.\. (Hints for this exercise are at the end of the book.) 

(b) To show that pf t \ is continuous it suffices to show that the action 

Vf,x x Gq — > Vf,x 

is continuous. Explain why this statement is independent of whether Vf.x is 
viewed as a vector space over Kf^x or over Q^. Explain why the action is 
continuous in the latter case. 

(c) Use the decomposition from (a) to show that ker(pA f ,e) C ker(p/ ) x) for 
each A. 

9.6 Galois representations and Modularity 

This last section states the Modularity Theorem in terms of Galois represen- 
tations, connects it to the arithmetic versions in Chapter 8, and describes how 
the modularity of elliptic curves is part of a broader conjecture. Finally we 
discuss how the modularity of Galois representations and of mod I represen- 
tations are related. 

Definition 9.6.1. An irreducible Galois representation 

p : G Q — ► GL 2 (Q,) 

such that detp = \e * s modular if there exists a newform f £ S2(T 0 (Mf)) 
such that Kf_\ = Qi for some maximal ideal A of Ok s lying over I and such 
that pf yX ~ P- 
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In particular, if E is an elliptic curve over Q and I is prime then the Galois 
representation pej is a candidate to be modular since it is irreducible and its 
determinant is xe by the proof of Theorem 9.4.1. 

Theorem 9.6.2 (Modularity Theorem, Version R). Let E be an elliptic 
curve over Q. Then p E j is modular for some I. 

This is the version that was proved, for semistablc curves in [Wil95] and 
[TW95] and then for all curves in [BCDT01]. We will explain how Version R 
of Modularity leads to Version a p from Chapter 8, which in turn implies a 
stronger Version R, 

Theorem 9.6.3 (Modularity Theorem, strong Version R). Let E be 

an elliptic curve over Q with conductor N. Then for some newform f € 
S2(r n (N)) with number field K/ = Q, 

Pf,i~PE,e for all £. 

Given Version R of Modularity, let E be an elliptic curve over Q with con- 
ductor N. Then there exists a newform / € ^(/^(M/)) as i n Definition 9.6.1, 
so pf,\ <~ pej for some suitable maximal ideal A of its number field K /. Thus 
PE,e(Frob p ) satisfies the polynomial x 2 — a p (f)x + p for any absolute Frobe- 
nius clement Frob p where p lies over any prime p\ £Mf, since pf_\ does. But 
the characteristic polynomial of p£,^(Frob p ) for any Frob p where p \ £N is 
x 2 — a p (E)x + p. Therefore a p (f) = a p (E) for all but finitely many p. The 
work of Carayol mentioned at the end of Chapter 8 shows equality for all p 
and shows that Mf — N. This is Version a p . 

On the other hand, given Version a p of Modularity, again let E be an el- 
liptic curve over Q with conductor N. There exists a newform / G S2{Tq{N)) 
such that a p {f) = a p (E) for all p. Since a p {f) <E Z for all p, the number 
field of / is Q and the Abclian variety Af is an elliptic curve. Consider 
the representations pfj — PA,,i and pE,e for any I. The respective charac- 
teristic polynomials of p/j(Frob p ) and p E ,e(F*ob p ) are x 2 — a p (f)x + p and 
x 2 — a p (E)X + p for all but finitely many p. Thus the characteristic polyno- 
mials are equal on a dense subset of Gq , and since trace and determinant are 
continuous this makes the characteristic polynomials equal. Consequently the 
representations are equivalent (Exercise 9.6.1). Since I is arbitrary, this is the 
strong Version R. 

The reasoning from Version R of Modularity to Version a p and then back 
to the strong Version R (or see Exercise 9.6.2) proves 

Proposition 9.6.4. Let E be an elliptic curve over Q. Then 

if Pe,£ is modular for some £ then pe.i is modular for all I. 

Theorem 9.5.4 generalizes to a result that Galois representations are asso- 
ciated to modular forms of weights other than 2. 
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Theorem 9.6.5. Let f e Sk(N,x) be a normalized eigenform with number 
field K^. Let I be prime. For each maximal ideal A o/Okj lying over £ there 
is an irreducible 2-dimensional Galois representation 

P/ lA :G Q — ►GL 2 (K / , A ). 

This representation is unramified at all primes p \ IN . For any p C Z lying 
over such p, the characteristic equation of pf y \(Frob p ) is 

x 2 -a p (f)x + X (p)p k - 1 = 0. 

The characteristic equation shows that 

detp/,A = xx\~ Y 

where the Dirichlet character x is being identified with the Galois representa- 
tion p x from Section 9.3 (Exercise 9.6.3). It follows that the representation pf t \ 
satisfies 

det p/^(conj) = — 1 

where as before, conj denotes complex conjugation. Indeed, x(conj) = x{~ 1) 
as noted in Section 9.3, and necessarily x(— 1) = (— l) k for Sk(N,x) to be 
nontrivial. Since Xf( con .j) = — 1 from Section 9.3 as well, the result follows. In 
general, a Galois representation p such that det p(conj) = — 1 is called odd. 

Section 9.5 constructed pf.\ when k = 2 but didn't prove that pf^\ is irre- 
ducible or that the equation is the characteristic equation. The construction 
for k > 2, due to Dclignc [Dcl71], is similar but uses more sophisticated ma- 
chinery. The dual of Sk(Fi(N)) contains a lattice L that is stable under the 
action of Tz and such that L ®q admits a compatible action of Gq. To 
define the Galois action and generalize the Eichler-Shimura Relation, Dclignc 
used etale cohomology. The construction for k = 1, due to Deligne and Serre 
[DS74], is different. It uses congruences between / and modular forms of higher 
weight, and it produces a single representation with finite image, 

Pf : Gq — > GL 2 (K/), 

that gives rise to all the Pf,\- By embedding Kj in C we can view pf as a 
complex representation Gq — > GL2(C), a phenomenon unique to k = 1. For 
example the representation p from the end of Section 9.1 is pg x . 

Theorem 9.5.4 also generalizes to Eisenstein series and reducible represen- 
tations. Recall the Eisenstein series Ef' v € £k(N,x) from Chapter 4, where 
ip and ip are primitive Dirichlet characters modulo u and v where uv | N and 
tpip — x at level N. The Fourier coefficients given in Theorem 4.5.1 for n > 1 
are 

a n (E^) = 2at^(n), where af^n) = ^ ^{n/m) V {m)m k -\ 

m\n 
m>0 
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Recall also the Eisenstein scries 

tfl>,<p,t/ ) = i E t' V ( tT ) unlcss k = 2,ip = <p = l, 

k (T> \El> x (T)-tEl^(tT) if k = 2, V = <p= 1. 

Here t is a positive integer and tuv \ N. These form a basis of £k{N,x) as 
the triples {ip, ip, t) run through the elements of a set A N ^ k such that ip<p = x, 
cf. Theorem 4.5.2, Theorem 4.6.2, and Theorem 4.8.1. By Proposition 5.2.3, 
E^ ,v,t for such a triple is an eigenform if uv = N or if k = 2 and ip = (p = 1 
and i is prime and JV is a power of t. 

Theorem 9.6.6. Let f = \E^' ip ' t E £k(N,x) where E^'^'* is an eigenform as 
just described. Let £ be prime. For each maximal ideal A o/Okj lying over I, 
the reducible 2-dimensional Galois representation 



ip 0 

o vxf 



is unramified at all primes p \ IN . For any p C Z lying over such p, the 
characteristic equation of pf^\(Froh v ) is 



x 



■ 2 -a p (f)x + X (p)p k - 1 = 0. 



To prove this, again recall from Section 9.3 that any primitive Dirich- 
let character <p modulo N acts as a Galois representation unramified at the 
primes p not dividing its conductor, and it takes Frobp to <f>(p) for such p. 
Thus if also p \ I then det p/^Frobp) = tp(p)(p(p)p k ^ 1 = x(p)p k ~ 1 as desired. 

Also trp />A (Frob p ) = ip{p) + ip{p)p k ^ 1 = crf-iiP)- If * = 1 thcn this is a p(f)- 
In the exceptional case when k = 2 and tp = (p = 1 and £ > 1 it is <Ji(p), 
which is (ii(p) — tai{p/t) since p\ N = t e . Again this is a p (f). 

All reducible representations that plausibly arise from Theorem 9.6.6 in- 
deed do so. 

Theorem 9.6.7. Let the Galois representation p : Gq — > GL2(L) be odd, 
reducible, and semisimple, i.e. p ~ q " . If ip> has finite image and det p = 

XX k (~ X where \ has finite image then for some eigenform f = ^E^' tp ' t with 
ip<p — x as described before Theorem 9.6.6, 



Pf.x ^ P- 

Here we arc using the result from Section 9.3 that 1-dimensional Galois 
representations with finite image can be viewed as Dirichlet characters. 

A modularity result is conjectured for irreducible representations as well. 
This requires slightly broadening the definition of a modular representation. 
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Definition 9.6.8. Consider a Galois representation 

p:G Q ^GL 2 (L). 

Suppose that p is irreducible and odd and that det p = Xx\~ Y where x has 
finite image. Then p is modular if there exists a newform f e Sk(Mf,x) 
such that for some maximal ideal A of C K/ lying over I, K f_\ embeds in L 
and pf,\ <~ p. 

The Fontaine Mazur-Langlands Conjecture [FM93] is 

Conjecture 9.6.9. Let p : Gq — > GL 2 (L) be irreducible, odd, and geomet- 
ric. Then XiP is modular for some integer r. 

The exact definition of a geometric Galois representation is beyond our 
scope. Modular representations are geometric, and Conjecture 9.6.9 states 
a converse. Besides representations pej, and the representations in Theo- 
rem 9.6.7, the geometric representations also include the representations of Gq 
with finite image. The statement that any irreducible, odd, 2-dimensional Ga- 
lois representation with finite image is modular is a version of Artin's Con- 
jecture. In this case k = 1 and L can be replaced by a number field in Def- 
inition 9.6.8. The result was proved for solvable finite image by Langlands 
[Lan80] and Tunnell [Tun81]. In the case of projectively dihedral represen- 
tations the relevant weight 1 eigenform giving rise to p is a theta function 
already considered by Hccke [Hec26], generalizing the theta function 9 X from 
Section 4.11 that gives rise to the representation with image D3 at the end of 
Section 9.1. The only possible nonsolvable finite image is projectively the alter- 
nating group A 5 . Modularity in this case was proved by Buzzard, Dickinson, 
Shcpherd-Barron, and Taylor [BDSBT01] [Tay03] under technical hypotheses, 
building on ideas in Wilcs's proof. But in general Conjecture 9.6.9 is still quite 
open. 

Mod I representations as in Exercise 9.4.4 play a prominent role in the 
proofs of Format's Last Theorem and Modularity. Let / e Si{T\{M f)) be a 
newform and let A <G Ok/ he over £. By Proposition 9.3.5 we may assume up 
to similarity that the representation pj\ maps to GL 2 (0 K/ ,a)- So it has a 
mod I reduction 

Pf,x ■ Gq — ► GL 2 (0 K/ ,a/AO K/ ,a). 

More generally we consider mod I representations p : Gq — > GL 2 (F£) where 
has the discrete topology and p is continuous. Since Gq is compact this 
means that the image is finite and therefore lies in GL 2 (F£r) for some r. The 
notion of modularity applies to mod i representations as well. 

Definition 9.6.10. An irreducible representation p : Gq — > GL 2 (Fe) is 
modular of level M if there exists a newform f £ 5 2 (A(-^)) and a maximal 
ideal A C Ok/ lying over £ such that pf X ~ P- 
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A modularity conjecture for mod I representations is due to Serre [Ser87]. 
Its formulation uses a recipe giving a minimal level M(p) in terms of the 
ramification of p. 

Conjecture 9.6.11. Let p : Gq — > GL 2 (F£) be irreducible and odd. Then p 
is modular of level M(p). 

Excepting some cases with t = 2 it is known, building on work of Ribet 
[Rib90], that 

ifp is modular then p is modular of level M(p). 

The relation of this to Format's Last Theorem is that a nontrivial solution to 
the Fermat equation 

a e + b e + c e = 0 
would give rise to the corresponding Frey curve 

E F : y 2 = x(x-a e )(x + b e ). 

The Fermat equation reduces to the case gcd(a, b, c) = 1, a = — 1 (mod 4), and 
b even. The Frey curve is then semistable. Also, the number field Q(£jr[^]) is 
then unramified outside 2 and £, and even the ramification there is unusually 
small, making M(p Ef1 ) — 2. But <S 2 (r 0 (2)) = {0} since the corresponding 
modular curve has genus 0, cf. Figure 3.4. Thus p EF t is not modular and so 
Pe f ,i is n °t modular. 

On the other hand, [Wil95] begins with any elliptic curve E over Q and 
considers the mod 3 representation 

Pe,3 '■ Gq — ► GL 2 (F 3 ). 

There is an embedding i : GL 2 (F 3 ) — > GL 2 (Ok) where K is a num- 
ber field (Exercise 9.6.4(a)), and GL 2 (F3) is solvable. So if p B 3 is irre- 
ducible then the Langlands-Tunnell result shows that the 3-adic represen- 
tation i o p E 3 arises from a weight 1 newform. One can show that therefore 
p E 3 arises from a weight 2 newform, making it modular in the sense of Def- 
inition 9.6.10 (Exercise 9.6.4(b-h)). After this the core of the argument in 
[Wil95] and [TW95] shows that under technical hypotheses on a representa- 
tion p : Gq — > GL 2 (Z^) with mod I reduction p, 

ifp is modular then p is modular. (9-17) 

The technical hypotheses apply when E is semistable and p = Pe,3- This all 
assumes that p E 3 is irreducible, but if it isn't then the proof argues that p E 5 
can be used as a starting point instead. Thus Wiles and Taylor-Wiles show 
that for any semistable E one of pe,3 or p E ^ is modular. Since the Frey curve 
is semistable this combines with Ribct's result to contradict Proposition 9.6.4, 
and thus the Fermat equation has no nontrivial solution. The contribution of 
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[BCDT01] to Modularity is to relax the technical hypotheses so that p E3 
or p E 5 meets them even when E is not semistable. 

The argument using the Langlands-Tunnell result to show that ~p E 3 is 
modular applies to any p taking values in GL 2 (F 2 ) or GL 2 (F 3 ). Conjec- 
ture 9.6.11 is also known under technical hypotheses for representations 
in GL 2 (F£r) for £ r = 4,5,7,9, but in general it is still quite open. Rama- 
krishna [Ram02] has shown that mod £ representations lift to geometric £-adic 
representations. With techniques now available to prove implications of the 
form (9.17), this means that the Fontaine-Mazur-Langlands Conjecture and 
Serre's Conjecture are closely related. 

Exercises 

9.6.1. Consider two representations p, p' : Gq — > GL 2 (Q£) with p irreducible 
and detp(conj) = —1. Assume that p and p' have the same characteristic 
polynomial. Prove that p and p' are equivalent as follows. 

(a) Show that p(conj) has distinct eigenvalues in Q^. After conjugating if 
necessary we may assume that p(conj) = //(conj) = [J _?]• (Hints for this 
exercise are at the end of the book.) 

(b) Show that there is some a £ Gq such that p{a) = [ a c b d ] with b 
and c nonzero. Write p'(cr) — ^,]. Use the relations tr p(a) = trp'(a), 
trp(conja) = tr p' (conj a) , and det p(a) = detp'(a) to show that a = a', 
d = dl , and be = b'c' ^ 0. Show that after a further conjugation if necessary, 
p(a)=p'(a). 

(c) For arbitrary r € Gq show that p(r) = p'(t) by applying the equality 
tr p = tr p' to t, conj t, err, and conj err. 

9.6.2. The proof of Proposition 9.6.4 does not require proceeding via Ver- 
sion a p of Modularity and Carayol's work. If p E .e is modular for some I then 
there exists a newform / e 5 2 (r 0 (M/)) such that K^>, = for some A 
over I and such that <~ p E ^. Explain why therefore a p (f) = a p (E) for 
almost all p. As explained in Section 8.8, Strong Multiplicity One now shows 
that Kf = Q, so the Galois representations associated to / take the form 
Pf,e ■ Gq — > GL 2 (Qf) for every I. Use Exercise 9.6.1 to show that therefore 
Pf,e ~ Pe,£, i.e., pej, is modular for all I. 

9.6.3. (a) Let pf \ be the representation in Theorem 9.6.5. Use the charac- 
teristic equation of pf ^(Frobp) for p\lN to show that det pf t \ = XXi~ X ■ 

(b) Conversely, suppose that the theorem stated instead that detp/^ = 
XXi~ X an d that pf^\(Frobp) satisfies the displayed equation for all primes 
p \ £N. Show that the equation is the characteristic equation of pf : \(Frob p ) 
for such p. 

9.6.4. This last exercise carries out the first part of the argument in [Wil95]. 
(a) It is known that the projective group PGL 2 (F3) is isomorphic to the 

symmetric group S4. Show that the map 
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"-1 1 

-1 0 

defines an embedding i : GL 2 (F 3 ) — > GL 2 (ZN/^2]). Note that Z[V^2] is the 
number ring Ok where K = Q(v / — 2). Show that the ideal A = (1 + \/— 2) is a 
maximal ideal of Ok lying over 3 such that the residue field kA is isomorphic 
to F 3 . Show that following i by reduction modulo A gives the identity. (Hints 
for this exercise are at the end of the book.) 

(b) Let E be an elliptic curve over Q and let / e S\(Mf, if>) be the new- 
form provided by the Langlands-Tunnell result such that pj t \ ~ i o p E3 . By 
Proposition 9.3.5 we may assume that p/ t A maps Gq to GL2(Ok)- Show that 
the reduction p~f \ of pf.\ modulo A is ~p E $- Show that a p (f) = a p {E) (mod A) 
for all but finitely many p. Show that ip is the quadratic character with con- 
ductor 3, so that it makes sense to write ip(p) = p (mod A) for all p. 

(c) Recall from Section 4.8 the weight 1 Eisenstein series Ef' 1 £ Mi(3,tp). 
Show that 

00 

n=l 

where a n G 3Z for all n > 1. Let g = SEf' 1 /. Show that g lies in £2(^0 (3-M/)) 
and that / = g (mod A), meaning a n (f) = a n (g) (mod A) for all n. 

(d) Although / and SEf' 1 are eigenforms their product g need not be. 
Nonetheless, show that T p g = a p (E)g (mod A) for all but finitely many p and 
Tg = ai(Tg)g (mod A) for all T € Tz- Deduce that the map 

(j, : T z — > F 3 , T i-> oi(T fl ) (mod A) 

is a homomorphism taking T p to a p (E) (mod 3) and (p) to 1 for all but finitely 
many primes p. 

(e) We want to obtain an eigenform g' e £2(^0 (3M/)) that gives rise 
to p E 3 . This requires using ideas from Chapter 6 and commutative algebra 
from Chapter 8. Let m — ker <j> and let P be a minimal prime ideal of Tz 
contained in m, cf. Exercise 8.5.6. Since T z is a finitely generated Z-module, 
Tz/P is an integral domain that is finitely generated as a Z-module. And 
since Tz is free over Z (cf. Exercise 6.5.1) no rational prime p is a zero-divisor 
in Tz, so no rational prime is contained in P. Therefore Tz/P contains Z, 
i.e., it is of characteristic 0. Its field of quotients is thus a number field K' 
and Tz/P is contained in its ring of integers Ok' - This argument shows that 
there is a homomorphism 

0':T Z ^O K <. 

A maximal ideal A' of Ok' containing <fr'(m) exists by Lemma 8.5.5 applied to 
the localization of Tz/P at m/P or by the lying-over theorem in commutative 
algebra. Show that <j>'(T p ) = a p (E) (mod A') and 4>'{{p)) = l (mod A') for all 
but finitely many p. 

(f) Before continuing we need to show that the natural surjection 





1 1 
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-1 




1 V 


-1 0 








1 y 


7 


1 


1 



9.6 Galois representations and Modularity 399 



7T : T z <8> C — > T C) 51 T « 0 z « ^ S ZjT * 

i i 

is an isomorphism. Reread the proof of Theorem 6.5.4. Note that the vec- 
tor space V in the proof is Hi (X\ ( N) , Z) <g> C. Since the action of Tz 
on Hi(Xi(AT),Z) is faithful, the action of T z <£> C on V is faithful as well. 
Explain how this shows that (using the notation in the proof) Tz <8> C acts 
faithfully on S 2 © S£ , 

(J2 T *® z *) (/' <P) = ( E ^ T ^' V ° S z ^) • 

i it 

Use this to show that 7r is injective. 

(g) Using the identification Tz (£> C = Tc and the containment Ok' we 
may extend <fr' to a homomorphism <fi' : Tc — 5- C given by 4>' Ti ® Zj) = 

Zift (Ti) . Convince yourself that the pairing in the proof of Proposi- 
tion 6.6.4 gives rise to a corresponding eigenform g' e 52(A(3M/)) with 
coefficients in Ok' such that 

cj>'(T)=a 1 (Tg'), T E T z . 

Show that dp(g') = a p (E) (mod A') and Xg'(p) = 1 (mod A') (where Xg' is the 
character of 5', cf. Exercise 5.8.1) for all but finitely many primes p. Parts (e) 
and (g) of this exercise are the Deligne-Serre Lifting Lemma. 

(h) Let g" be the newform associated to g' by Proposition 5.8.4. Show 
that Kg// C K g / and that a p (g") = a p (E) (mod A") for all but finitely many 
primes p, where A" = K s » HA'. Use the argument in Exercise 9.6.1 to conclude 
that ~p g i, yi <~ p E 3 . Hence p E 3 is modular in the sense of Definition 9.6.10. 



Hints and Answers to the Exercises 



Chapter 1 

1.1.4. (b) For \S\ < 1, \t + S\ > B = B ■ sup{l, \5\] for all t e ft. For 
I < \5\ < 3A and Im(r) > A, |r + £| > A > \5\/3 = sup{l, |<5|}/3. For 
1 < H — and -B < Im(r) < A, the quantity |r + 5|/|<5| takes a nonzero 
minimum m, so |t + <5 > m\S\ = m ■ sup{l, \5\}. For \5\ > 3A \t + S\ > 
\6\ — A> 2\S\/3 = 2sup{l, \6\}/3. So any positive C less than inf{S,l/3,m} 
works. For (c), break the sum into 2£(fc) + X) c ^o d \ CT + ■ -^ or c 0, 
|cr + rf| = |c||t + <5| where 5 = <i/c. Apply (b) and then (a). Since any compact 
subset of H sits inside a suitable i?, holomorphy follows from a theorem of 
complex analysis. 

1.1.5. For the first formula, take the logarithmic derivative of the product 
expansion sin-7rr = 7rrn^Li(l — r 2 /n 2 ). The second formula follows from the 
definitions of sin and cos in terms of the complex exponential. 

1.1.6. (d) To show h(0) = 0, simply antidifferentiate. To show h(m) = 0 for 
m < 0 replace r by — r and reason as in part (c) except now the rectangle no 
longer goes around the singularity at the origin. 

1.2.2. (a) Let g = gcd(c, d) and note that g is relatively prime to N. If c ^ 0, 
set s = 0, and use the Chinese Remainder Theorem to find t congruent to 1 
modulo primes p \ g and congruent to 0 modulo primes p \ g, p \ c. If c = 0 
then d 0 (unless N = 1, in which case the whole problem is trivial) and a 
similar argument works. 

1.2.3. (a) One way is by induction on e. For e = 1, |GL2(Z/pZ)| is the 
number of bases of (Z/pZ) 2 , and SL 2 (Z/pZ) is the kernel of the surjective 
determinant map to (Z/pZ)*. For the induction, count ker(SL 2 (Z/p e+1 Z) — > 
SL 2 (Z/p e Z)). The map surjects since SL 2 (Z) — > SL 2 (Z/p e Z) does. For (b), 
SL 2 (Z/WZ) J]* SL 2 (Z/p^*Z) where N = UiPV> h V the Chinese Remainder 
Theorem. 
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1.2.6. (b) Since (f[a]k)(T+N) — f[a]k(r) we may estimate |/[a]fc| as qn — > 0 
letting y — > oo and assuming 0 < x < N where r — x + iy. So |ct + rf| grows 
as y and |(/[a] fc )(r)| = |/(a(T))||cT + d|- fc is bounded by (C 0 + C/Im(a(r)) r ) • 
|cr + d\- k , or (C 0 + C\ct + d\ 2r /y r )\cT + d\~ k . We may take r > 1 and thus 
absorb the first term into the second. The result follows. 

1.2.8. (b) For the inverse, note E 2 (t) — S 2 (7(7 _1 (t))). For (e), note that 
for 7 G r 0 (N), Nj(t) = j'(Nt) for a certain 7' G SL 2 (Z). Use the Fourier 
series and Proposition 1.2.4 to check the holomorphy condition. 

1.2.9. See Exercise 1.1.7(b) for the relevant £- values. 

1.2.10. (a) The sum is — S^Li Em=i q nm /m. The sum of absolute values 
is J2 m J2n\l\ nm / m = Em M"V( m (l _ M™)); an d tnis converges since its 
terms are bounded by e - 27Tmlm ( T ) as to — > 00. The convergence is uniform 
on compact sets because any compact subset of H has a point with minimal 
imaginary part. 

1.2.11. (b) If c ^ 0 then write a/c = a'/c' with a',d G Z, gcd(a',c') = 1. 
Now left multiplying by an SL 2 (Z) matrix with top row (c',— a') clears out 
the lower left entry. For the second part, if f[a]k has period h then f[y}k has 
period dh. 

1.3.2. Show (1) (3) (2) (1). 

1.3.3. (c) The entire exercise is set in one E, so multiplying by d takes points 
P + AtodP + A, not to dP + dA. 

1.4.1. (c) The integrals along opposing pairs of boundary edges cancel down 
to l/(27ri)(w 2 Jo" 1 f'(z)dz/f(z)-u 1 J^ 2 f'(z)dz/f(z)). Each of these integrals 
is J dlog/(z) along a path with equal /-values at the endpoints, so log/ 
changes by an integer multiple of 2iri along each path. For the second part, if 
f(z) = {z- x)^g(z) where g(x) ^ 0 then zf'(z)/f(z) - v x (f)z/{z - x) + 
zg'(z)/g(z) has residue v x {f)x at x, so the Residue Theorem gives the result. 

1.4.3. For A^ show that p([i3z) = /i3p(z) and p(z) — p(z). If p(z) = 0 
then also p(^z) — 0, so fi 3 z = ±z (mod A) since p is even and has only two 
zeros. The minus sign can not arise because it would imply z = — z (mod A) 
but p is nonzero at the order 2 points. The zeros are z = (1 + 2^ 3 )/3 and 
z= (2 + M3 )/3. 

1.5.1. For r(N), take any point [E, (P,Q)] of S(iV). After applying an iso- 
morphism (which preserves the Weil pairing by Exercise 1.3.3(d)) we may 
assume E = C/A T ,, P = (ar' + b)/N + A T >, Q = (ct' + d)/N + A T , with 
a,b,c,d G Z/NZ and t' G H. Since P and Q generate E[N] and have Weil 
pairing e N (P, Q) = e 2lxi l N ', [ a c b d ] G SL 2 (Z/iVZ). Lift this matrix back to some 
7 G SL 2 (Z), let r = 7(7-') and to = ct' + d, so that mr = ar' + b, and show 
that [E,{P,Q)\ = [C/A t ,{t/N + A T ,1/N + A T )\. 
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1.5.2. The map S(N) — * Si(JV) takes ip-\r(N)T) to ^(AWr). Com- 
pute this first for points in the form [C/A T , (t/N + A T , 1/N + A r )], and then 
show that in general the map works out to [E, (P, Q)] i-> [E, Q\. 

1.5.3. Each b (mod N) e Z/NZ acts on the modular curve Y(N) as 
b (mod N) : r(N)r ^ r(N)-f(r) where 7 = [J f ] S A(iV). The correspond- 
ing action on the moduli space S(N) under ip -1 is therefore 

b (mod AT) : [E T , (t/N + A r ,l/N + A T )\ 

^ [E y(T) , (7(r)/JV + vl 7(T) , 1/JV + A^))]. 

By the methods of the section, letting m = j(7, r) shows that the image 
is [E T ,[m^[r)/N + A T ,m/N + A T )]. Since 7 = [J J], m = 1 and this is 
[E T , ((r + 6)/AT + /l r , 1/AT + yl T )], showing that the map is 

6 (mod N) : [E, (P, Q)} ^ [E, (P + bQ, Q)]. 

Argue similarly for d mod A^ e (Z/NZ)* acting on Si (AT), this time letting 
7 = [ « Jj] e r 0 (N) for suitable a, fc, AT. 

1.5.4. Note that multiplying /l_i/(Ar r ) by r gives the lattice rZ ffi (l/N)Z. 
The map is [S, C] ^ [E/C, E[N]/C\. 

1.5.6. (a) Recall the condition C n (Q) = {0 S }. If p { A" then the bottom 
row of the matrix is not always (0, 1). 

Chapter 2 

2.1.3. (a) Let y 1 = inf{Im(r) : r G [/(}, Fj = sup{Im(T) : r G U[}, 
y 2 = inf{Im(r) : r e all positive. Then for 7 = e SL 2 (Z) and 
t £ f/j, the formula Im(7(r)) = Im(r)/|cr + d\ 2 shows that Im(7(r)) < 
min{l/(c 2 ?/i), Yi/(cRc(r) + d) 2 }. The first of these is less than y 2 for all but 
finitely many values of c; for each exceptional c, the second is less than y 2 
uniformly in r for all but finitely many values of d. 

2.2.5. Both 7r and tp are open and continuous. 

2.3.4. (b) For part (b), Z[7] is ring isomorphic to Z[i]. For part (a), if 7 has 
order 3 then —7 has order 6. 

2.3.5. (a) The matrices fixing ^ 3 are [ _% a b _ b ], (2a - b) 2 + 3b 2 = 4. 

2.3.7. (c) It suffices to show the result for r 0 (p), p=-l (mod 12). If [ a c b d ] 
fixes a point then a + d € {0, ±1} and ad = 1 (mod p). If a + d = 0 then this 
means a 2 = — 1 (mod p) , impossible by the nature of p (mod 4) . The other 
two cases are similar with p (mod 3) . 
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2.3.8. Show that also SL 2 (Z) = \J j -fj 1 {±I}r. Given r e U, some 7 in 
some 7~ 1 {±J}7 n transforms r into D. 

2.4.4. (a) r(N) C SrS- 1 for some N e Z+. (b) Note that SL 2 (Z) = 
5SL 2 (Z)5-\ 

2.4.6. See Corollary 2.3.4, recall that 1111(7(7-)) = Im(r)/|cr+ d| 2 for 7 = 
[jj] £ SL 2 (Z), and note that if r e {i, p 3 } then |cr + rf| > 1 for all nonzero 
integer pairs (c, d) . 

Chapter 3 

3.1.1. Let {±I}r 2 = ^{±1}^^. Then f~ l {r 2 T) = {AT,?-} and these are 
distinct when r is not an elliptic point for _T 2 . 

3.1.2. Show that for 7 G T 2 , the indices [{±/}r , 2 . 7(T) : {±/}r li7(r) ] and 
[{±/}i\ T : {±/}A,r] are equal. 

3.1.4. (b) The cusps are 0 and 00, cf. Figure 3.1. For (c), 70^ (i) = aj(i) if and 
only if aJ^-^/aj^) — i, and by Corollary 2.3.4 the nontrivial transformations 
in SL 2 (Z)j are ± [ 1 ~o] ■ Similarly for (d). The data to compute g in this and 
the next two problems are tabulated in Figure H.l. 



r 


d 


£2 


£3 


£"oo 


r 0 ( P ) 


P + l 


1 if p = 2 
2 ifp = 1 (4) 
0 ifp = 3 (4) 


1 if p = 3 
2 if p = 1 (3) 
0ifp = 2 (3) 


2 


A (2) = r 0 (2) 


3 


1 


0 


2 


A(3) 


4 


0 


1 


2 


A(P), P>3 


2 


0 


0 


p-1 


r(2) 


6 


0 


0 


3 


rip), p>2 


p(p"-i) 
2 


0 


0 


P 2 -i 
2 



Figure H.l. Data to compute the genus 



3.1.5. (b) For p > 3 the answer is g = 1 + (p - l)(p - ll)/24. 

3.1.6. The cusp 00 has ramification degree p under 7r : X(p) — > SL 2 (Z)\'H*, 
and since T(p) is normal in SL 2 (Z) so do all the other cusps, cf. Exer- 
cise 2.4.4(c). The number of cusps follows from the degree of the map 
and the number of cusps of SL 2 (Z)\"H*. For p > 2 the answer is g = 
l + (p 2 -l)(p-6)/24. 
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3.2.2. Let A = 240 °3(n)g™ and B = -504£~ =1 a 5 (n)q n . Cite Ex- 
ercise 1.1.5 to show that (60£ 4 (t)) 3 = (64tt 12 /27)(1 + A) 3 and A(t) = 
(64tt 12 /27)((1 + A) 3 - (1 + B) 2 ). Consequently j(r) = 1728(1 + A) 3 /(3A + 
3A 2 + A 3 -2B-B 2 ). The numerator lies in 1728(1 + qZ[[q]]), where the double 
brackets denote power series. In the denominator 3A 2 and A 3 and B 2 lie in 
1728g 2 Z[[q]], and since d 3 = d 5 (mod 12) for all integers d, also 3A - 2B lies 
in 1728g(l + gZ[[g]]). 

3.2.3. We may assume r = SL 2 (Z). For weight-2 SL 2 (Z)-invariance use 
the chain rule relation (j o 7)'(r) = j' (l( T ))l' ( T )- Also remember to check 
meromorphy at oo. Show that (j') k ^ 2 € .4fc(SL 2 (Z)) for positive even k. 

3.2.5. The group is the matrices ± [ J \] such that ±a [ J ^] a" 1 e A (4). 

3.5.3. For the last part show that for all k e Z, multiplying by the cusp 
form Z\ defines an isomorphism .Mfc(SL2(Z)) — >• 5fc + i 2 (SL 2 (Z)). 

3.5.5. Recall Exercise 3.5.4(b) with TV = 11. 

3.7.1. (a) Suppose 7 £ SL 2 (Z) has order 4 or 6 and fixes the point r £ H, 
and suppose 7" 1 = a-ya^ 1 with a £ GL^(Q). Then 7 _1 fixes a(r), so a fixes 
r since a takes H to H and the other fixed point of 7 -1 is f. Thus a £ (7) 
and 7 _1 = 7, contradiction. For (c), recall Proposition 2.3.3. 

3.7.4. The bottom entry of u 0 7 I = (al + 07) ] works out to cc 7 x + (a + 
crf 7 )y. Use information about c 7 and c? 7 to show that this is 0 (mod N). 

3.7.5. (b) For period 2, the ring A = Z[i] is a principal ideal domain and its 
maximal ideals are 

• for each prime p = 1 (mod 4) , two ideals J p — (a + hi) and J p = (a — bi) 
such that (p) — JpJp and the quotients A/J^ and A/J p are isomorphic to 
Z/p e Z for all e e N, 

• for each prime p = — 1 (mod 4), the ideal J p = (p) such that the quotient 
A/ Jp is isomorphic to (Z/p e Z) 2 for all e £ N, 

• for p = 2, the ideal J 2 = (1 + i) such that (2) = J| and the quotient 
A/JI is isomorphic to (Z/2 e / 2 Z) 2 for even e G N and is isomorphic to 

Z / 2 (e+l)/2 Z 0 Z / 2 (e-l)/2 Z for odd e g ]y_ 

3.7.6. (a) This is another elementary number theory problem. Use the Chi- 
nese Remainder Theorem to reduce to prime power N, and then see what 
happens when you try to lift solutions modulo primes p to solutions modulo 
powers p e . 

3.8.1. (a) The numerator and denominator linearly combine back to a and 
c under 7 _1 . 

3.8.2. (b) Since the summand (N/ d)cf)(d)<p(N / d) is multiplicative in d, the 
sum is multiplicative in N, so it suffices to take N = p e . 
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3.8.7. Since r(4) is normal in SL 2 (Z), and since the cusp oo is regular, all 
cusps are regular. See Exercise 5 of Section 3.2 for the argument that s — 1/2 
is an irregular cusp of A (4). Checking the cusp s = 0 is similar. 

3.9.1. Since the calculations involve only one value of TV such that —I belongs 
to the groups, only one irregular cusp, and only one nonzero value of e 2 , the 
formulas for even k and odd k usually agree. 



Chapter 4 

4.2.4. (b) Since v is a point of order N, gcd(gcd(c„, d v ), N) = 1. 

4.3.1. If x is nontrivial then replace n by n^n in the sum for some value no 
such that x( n o) 0- The second relation is proved similarly since if n ^ 1 in 
(Z/NZ)* then the proof of duality shows that x( n ) 1 f° r some character \. 

4.3.4. (a) This is a standard result from representation theory. To do it 
without citing anything, for each character \ '■ (Z/NZ)* — > C* , define the 
operator 

rv ' de(z/NZ)* 

on Aik(ri(N)). Show that ir x = ir x , so ir x is a projection; show that 
TT x {M k {ri(N))) C M k (N,x) and that 7r x = 1 on M k (N,x), so the pro- 
jection is on M k (N, x). Show that ^2 n x = 1 and that ir x o tt x ' — 0 when 
X 7^ x', so the subspaces M k (N, x) span and are linearly disjoint. 

4.4.1. For (b), change variables to get a Gaussian integral. For (c), integrate 
by parts. 

4.4.2. Compute that 

{-iTXif _ 257T fe (-l) fe / 2 

2r(k) ~ (1 • 3- -- (fc- 1))(1 - 2- • •(! - 1)) 



4.4.5. For (c), Exercise 4.4.4 and fact that ((s) has a simple pole at s = 
1 with residue 1 show that L(l N ,s) <~ <fi(N)/(N(s — 1)) as s -> 1. And 
(1 + (-l)- s )/(s - 1) = (e- ms - e~ ni )/{s - 1) is a difference quotient for the 
derivative of e^ %s at s = 1. For (d), compute that 

N-l oo 

£ x(n- 1 )i(l,%)= E tfn" 1 ) £ XHE^ 

(m,AT) = l 



4yy ,1 J y xk 1 ^). 

N ^ ^ m/N + d ^ y y " 

d m X^K 
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The inner sum is </>(TV) — 1 if m = n (mod TV) and —1 otherwise. The middle 
sum is taken over <p(N) values of m, so now 



N-l 



E X (n- 1 )i(l,x) = ^E E 



d=0 m=l 
(m,iV)=l 



n/N + d m/N + d 



where in this calculation the equivalence class n (mod TV) and its representa- 
tive in {1, . . . , TV — 1} are identified. Replacing n by — n and its representative 
by TV — n gives 



1 oo JV-l 

- E x((-n)- 1 )i(l,x) = ^E E 



i=0 m=l 
(m,JV)=l 



+ 



n/N-d-l m/N + d) ' 



Thus the sum — ]T (x^" 1 ) - x((-n)" 1 ))i(l, X ) in C"(l) is 



w E 



TV^ V n/N + d n/N-d-l 



This is 
TV ^ 



d=0 



1 



1 



1 



1 



n/TV + d+1 n/N-d-l n/N + d n/N + d+l 



1 

TV 



.d=0 



^\n/N + d + l ' n/N-d-l 



i/N 



1 ,77^ 
— cot ( — 
TV V TV' 



4.4.6. Use the expression for r in the exercise and switch to polar coordinates 
to get 



/•oo / i7r /2 

r(a)r(b)=4 e- r ' 2 r^ a+b -^ r dr < 
Jr=0 Je=o 



cos 211 - 1 9 sin 2 *- 1 9 d6. 



Let x = cos 2 0. 

4.5.3. (c) For each character ip modulo d where d | TV, Lemma 4.3.2 shows 
that half of the characters tp modulo N/d combine with ip to meet the parity 
condition unless N/d = 1 or N/d — 2. For the exceptional values d = N and 
d = TV/2 (if TV is even), for each character tp modulo N/d, Lemma 4.3.2 shows 
that half of the characters ip modulo d combine with ip to meet the parity 
condition unless d = 1 or d = 2. Thus \B N . k \ = (1/2) £ 4>(d)4>(N / d) for TV \ 4, 
as required by formula (4.3). For the exceptional cases, count that \B N ^ k \ for 
even k is 1 when TV = 1, 2 when TV = 2, and 3 when TV = 4, and that |-Bjv,fc| 
for odd A: is 0 when TV = 1 or TV = 2, and 2 when TV = 4. Again these match 
formula (4.3). 
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4.7.3. Cite the Monotone Convergence Theorem from real analysis, the fact 
that absolute integrability implies intcgrability, and the Dominated Conver- 
gence Theorem from real analysis. 

4.7.5. (c) Let z — ee ld and bound the absolute value of the integral by a 
quantity of the order l^ 15 " 1 !- 

4.8.2. This is very similar to Exercise 1.4.1, but integrate Za over t + dP 
instead. 

4.8.7. For (b) See Figure 3.4 and Theorem 3.6.1. For (c), .Mi (A (4)) = 
C Ef' 1 ' 1 where \ is the nontrivial character of (Z/4Z)*. A basis of A4 3 (Ti(4)) 
is {E^\E^ xA }. A basis of M 4 (A(4)) is {El' 1 ' 1 , E^ 1 ' 2 , £ 4 1,M }. Here all 
three basis elements contribute to 6(t,8). 

4.9.1. f(x) is uniformly approximated within e by an integral over a compact 
subset K of R/. In the integral f(x + 5x) - f(x), note that e -^(y,x+Sx) _ 
e -2™{ y ,x) = e -2^(y^)^ e -2MvM) _ i) ; an d if fix is small enough then the 
quantity in parentheses is uniformly small as y runs through K. 

4.9.3. (b) The square of the integral is JJ x e~ n ^ x2+y2 ^dx dy. Change to 
polar coordinates. 

4.9.4. The Fourier transform is f(x) = f°Z 0 y s ~ 1 e 27Tly ( T ~ x ' , dy. Replace y 
by z/{— 2m(r — x)), going from 0 to oo along a ray in the right half plane. 
The resulting integral equals a gamma function integral by complex contour 
integration. 

4.9.5. (b) r(s) is the Mcllin transform of e - '. Use part (a). 

4.10.2. Compute that 

/ f(yjr)e- 2 ^y^dy= f f{y)e- 2 ^~ 1 ' r ^d{y 1 - 1 /r) 

JyER 2 JyeR 2 

= r- 2 f f(y)e- 27Il ^- x ^ T / r Uy. 
J yen 2 

4.10.3. For nonzero x € G, note that ^' £]S ' > ^ 1 for j = 1 or j = 
2 where the ej are the standard basis vectors, and that J2 V Mtv'^ = 

(x,e^S) -c-^ (x,vS) • n 

MN Isv^N ' S0 tne SUm 1S °' 

4.10.5. (a) For the first part, show that ^2 u a(u)b(u) = ^2 v a(—v)b(v) m 
general. For the second part, replace b by b. 

4.10.6. (b) For k < 0, since fk{x) = f _ k (x) and in general Tp(x) = <f(—x) 
(show this), it follows from f_ k = (—i)~ k .f-k that f k = (—i) k fk as before. 
Similarly since f_ k {xS) = (— i)~ k f-k(x), also fk(xS) = (—i) k fk(x) as before. 
Note that hkixt 1 / 2 ^ h k (x)t^ k ^ 2 , that \n-f\-^- 2s = y- k / 2+s /\ct + d\- k+2s , 
and that hk(n-f) = h^ k (nj) = y k/2 /{ct + d) k . 
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4.10.8. For (a), recall formula (4.11). For (c), show that g(ip)g(ip) — ip(—l)u 
similarly to the calculation after (4.11) and recall that (tp(p)(— 1) = (— l) k . 



Chapter 5 

5.1.2. Take integers N u N 2 such that r(iVj) C G t for i = 1,2, and let 
N 3 = lcm(N 1 ,N 2 ); use the fact that each [d : r(N 3 )} is finite. 

5.1.3. Suppose that /3 and /?' represent the same orbit in ri\r\ar 2 , i.e., 
_Ti/3 = -Ti/3'. Letting (3 = 710172 and j3' = "/[ct"/ 2 translates this condition into 
a l2 € A<*72- Since / is weight- A: invariant under 7^, it quickly follows that 

fWk = fWh- 

5.1.4. Set h = lcm({hj}) where each gj has period hj. 

5.2.5. (c) If uv = N then £ = 1 and part (b) holds for all p. Suppose p \ N; if 
£ I n then use part (b), if £ + n but £ | np then p | f | iV, contradiction and hence 
this case can't arise, and if £ \ np then there is nothing to check. For n = 0 
there is nothing to check unless = 1\. For (d), a n (E) — a\(n)—tai(n/t) and 
a n (T p E) = o-i(np) - l N (p)p(Ji(n/p) - t{a\{np/t) - ljv(p)po-i(n/(£p))) when 
n > 1. If p \ N then use parts (b) and (c). If p \ N then the assumption is 
t = p, N = p* . Verify the result for p j n and for p | n. Also verify the result 
for n = 0. 

5.2.7. (a) In the desired equality ((t + j) /p)Z ® Z = ((t + j) /p)Z + tZ © Z 
one containment is obvious, and for the other note that r = p(r+j)/p—j. The 
groups (Npr + p)Ayn ™j( pT ) an d ^Ipr are equal by an argument in Section 1.5, 

and multiplying through by 1/p gives the second desired equality of groups. 

5.2.8. Recall the proof of Lemma 2.3.1 and Lemma 2.3.2. 

5.3.1. Checking n — p is straightforward. For e > 1 show that M pC -iM p = 
M r if p I N and M pC -iM p = M r U M pC - 2 if p \ N, and the result 
follows for n = p e . When gcd(m,n) = 1, M mn isn't quite M m M n but since 
[J \ ] € r 0 (N) the difference doesn't affect the weight-fc operator. 

5.4.1. (a) Start by writing x = (r + t)/2, y = (t — t)/2z, and recall that in 
the algebra of differential forms, drdr = df dr = 0 and dfdr = —drdr. 

5.4.3. If the unions SL 2 (Z) = {J({±I}r) ai , {±I}r = {J({±I}r')f3j are 
disjoint then so is SL 2 (Z) = U({±4 r ')/3j«*- 

5.4.4. Let SL 2 (Z) = \J j rfc so that also SL 2 (Z) = } r'a^j. 
5.5.1. (c) Use Proposition 5.5.2(a) for the first part. 
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5.6.3. (a) Add a parenthesized subscript to the Hecke operators to denote 
level. Checking the diagram reduces to showing that for /, g e S k (r 1 (Np^ 1 )), 
(1) T {Np -i)f = T {N) f and (2) {T( Np -x)g)[a p \ k = T {N) (g[a p ] k ). 

For T = (d), show that if 7 e Jo(JV) has bottom right entry d then (1) 
since also 7 € .To(Ap _1 ) it follows that (<i)(Ar p -i) is (d)(N) restricted to level 
Np~ x ; and (2) since a p -fa p 1 G Jo(ATp _1 ) has bottom right entry d as well, it 



follows that (d)(ATp-i) is [a p ]k • (d)(jv) 



For T = T p i, show that (1) 7] 



(composing left to right). 
) is Tp',(jv) restricted to level Ap~ 



and (2) if 5 e S k (Np 1 1 x) for some character \ ■ {Z/Np X Z)* — ► C*, then 
g[a p ]A; e S k (N,x) where x is now lifted to (Z/NZ)*. Use Proposition 5.2.2(b) 
to show (Tp/^jvp-ijff^apjfc = T p , ^ N )(g[a p ] k ) . 



1 j 
0 p 



-Ejs[[s;: 



1 j 
0 p 



(b) Going down and across takes (/, gr) to • /[ 
(this relies onp \ N); at level Np~ x the first sum is T„f—((p)f)[a p ] k regardless 
of whether p \ Np' 1 ; the second sum is X^g[[op] [0 i]] fc wm ch is p k ~ 1 g. 
Going across and then down gives the same expression. 

(e) Show that w (N) f = (w( Np -i)f)[a p ]k and w (N) (g[a p ] k ) = p k ~ 2 ■ 
and now checking the diagram is straightforward. 



-in 



(Np-i)9, 



5.7.2. If 72 



a (i 
7 S 



G 7d then det 72 = 1 and 72 = [ J f ] (mod A) and 



/3 = ZciV/d for some fcgZ. Write k = qd+ b, 0 < b < d, and compute 



72 



1 



a (3 
7 (5 - 



- abN/d 
jbN/d 



call 

= 7i- 



Then det 71 = 1, and (3 - abN/d = (1 - a)bN/d (mod A) = 0 (mod A), 
and 5 - jbN/d = 1 (mod N), so 71 = I (mod A). Thus 71 € r(A) and 
72 e r(A) [ J bw / d ]. For uniqueness, the coset J\A) [J fc ^ /d ] has its upper 
right entry (mod A) determined by b (mod d). 



5.7.4. If 7T is a projection then so is 1 — n. If 7Ti and 7r 2 are commuting 
projections then ker(7Ti7r 2 ) = ker(7Ti) + ker(7r 2 ) (one containment is clear; 
for the other, write x = y + z where y = 772(0;) and z — x — y and show 
y e ker(TTi), z e ker(7r 2 )). If n is a projection then ir e im(7r) if and only if 
n(x) = x (if x = w(y) then ir(x) — 7r 2 (y) = w(y) = x), and it follows that 
kcr(l — 7r) = im(7r). 

5.7.5. For example, the right side summand in Theorem 5.7.5 is 

s k (r\N/ P )) = s k (r(N)) rl (N/p)/r{N) 

= 5 fe (r(A))n^^ 1 (p?)/ r (P?')xr 1 ( P :-- 1 )/r( P :o 
= S k (r(N))nj# H i x < Ht ' Kt) by Lemma 5.7.6 

= s k (r(N))i H ' K *>. 



5.7.6. Setting V — S k (r(N)) isn't enough since the proposition requires V 
to be irreducible. 
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5.8.3. (b) Proposition 5.8.5 describes the Fourier coefficients of /, with \ 
the trivial character modulo 11. Formula (5.4) describes the T p -action on 
<S2(-To(88)) but here x is the trivial character modulo 88. 

5.8.4. By the methods early in the section, if ai(f) = 0 then / — 0 and if 
ai(f) ^ 0 then normalizing to ai(/) = 1 gives T n f — a„(/)/ for all n e Z + . 
Let / = g + h with g old and h new. Applying T n gives a n (f)f = T n g + T n h, 
and since T n preserves the decomposition of Sk(ri(N)) as a direct sum of old 
and new subspaces necessarily T n g = a n (f)g and T n h = a n (f)h. Similarly g 
and h are (n)-eigenforms for all n £ Z + . Thus g and h are eigenforms with 
T„-eigenvalues a n (f). If h — 0 then / = g is old. If /i ^ 0 then again by the 
methods early in the section a\{h) ^ 0 and T n h = (a n (h) / 'a\{h))h, showing 
that a n (f) = a n {h)/a\{h) and thus / = h/a\{h) is new. 

5.9.1. (a) See the more general argument in Section 5.4. 

5.9.2. First consider the product over a finite set of primes and the sum over 
n divisible only by these primes. 

5.11.2. For (a), let T*f — a p f and compute that 

%></,/) = (T;.fJ) = (f,T p f) = a p {f,f). 

For (b), if bp = a p for all p then Proposition 5.8.5 shows that Ef' v /2 — f is 
constant, making it the zero function, but this violates the linear disjointness 
of Sk(ri(N)) and £k(ri(N)). For (c), compute similarly to (a) that 

a p (Efj) = (Et' v ,T;f) = (T p Et",f) - b^J). 
For (d), use Proposition 5.5.2(a) and then Exercise 5.4.3. 

Chapter 6 

6.1.3. If dcg(/) = 1 then there is no ramification. 

6.1.4. By Proposition 6.1.4 ip(z + A g ) = mz + A for some row vector m € C 9 
such that mAg C A, and m ^ 0 since ip surjects. Thus ker(</?) takes the 
form V + A g where V C C 9 is a vector subspace of dimension g — 1. On 
the other hand im(/) contains J^ g for all x e X 0 (N). This includes 0 Jo (jv) so 
Oe € im((p o /), and span(im(/)) = Jo(-ZV) by Abel's Theorem so im(/) can't 
be a subset of ker(iys). This makes ip o / a nonconstant holomorphic map of 
compact Riemann surfaces, therefore a surjection. 

6.2.2. (a) The result is clear on the subset Y' of Y defined later in the section. 
Since norm/,/ extends continuously to Y as a function to C it is meromorphic. 



412 9 Galois Representations 



6.3.1. (a) Regardless of whether the weight-A; operator in general is defined 
by (/[o;]fe)(r) = (dct a) e j(a, r)~ k f(a(r)) with e = k — 1 or with e = k/2, 
the exponent is e = 1 for k = 2. The diagram says that g , (a(r))rf(a(r)) = 
(<7[ Q! ]2)(t)c£t for g G ^(/V), and this is easy to check for any g : % — > C. 

6.4.1. By definition of r as a resultant, r(u) = 0 if and only if there exists 
some t such that q(t, u) — 0 and q(t) = 0, and by definition of q as a resultant, 
there exists some t such that q(t, u) = 0 if and only if there exists some s such 
that p(s) = 0 and u = s + 1. Thus r(u) = 0 if and only if there exist s and t 
such that p(s) = 0, q(t) = 0, and u = s + t. In particular, r(a + (5) = 0. 

6.4.3. If a e Z n Q then a = s/t with gcd(s, t) = 1 and t n p(a) = 0 for some 
monic polynomial p with integer coefficients. This last relation implies t \ s, 
so t = 1. The other containment is clear. 

6.4.4. Each algebraic number a satisfies a polynomial x n + (ci/d)x™ _1 + 
• • ' + (cn/d) with C\, . . . , c„, d G Z. Consider cP<x 

6.5.2. For each d G (Z/iVZ)* take two primes p and p' both congruent to 
modulo N. Use formula (5.10) with r = 2 to express x(d) in terms of a p (f), 

<V(/)i v(/) 5 and v 2 (/)- 

6.5.3. For the first isomorphism, if </? G (M/JM) A then the map y> : 
m i y ip(m + JM) is an element of M A [J], and if ip € M A [J] then the 
map ip : m + JM i-> ■(/'(to) is a well defined clement of (M/JM) A . Show 
that ip ^ (p and ip invert each other and that cither of them is a Tc- 
modulc homomorphism. The second isomorphism follows from the first since 
a finite-dimensional vector space is naturally isomorphic to its double dual. 
In particular, if <p + JM A G M A /JM A then the restriction (p = </?|m[j] is the 
corresponding element of M[J] A . 

6.6.5. (a) Taking ip G ii(N) and omitting cosets from the notation, ° 
T»(r) - rup((T p r) o n) while ((a p (f) o VjMW) = rvp{T P W ° «))• 
These are the same by Section 5.6 since p{ TV. 

(b) Stack (6.19) on (6.20), show that the outer rectangle commutes, com- 
bine this with the top square commuting and isogenics surjecting to show that 
the bottom square commutes. 

Chapter 7 

7.1.4. (b) Substitute the relation y = \x+n into (7.1) to get a cubic equation 
4x 3 — X 2 x 2 + • • • = 0. Show that the roots are x P , xq, and r as in (7.3). Also 
letting s be as in (7.3), the sum P + Q = (r,s) agrees with (7.2). 
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7.2.3. (a) Since (p is a combination ^ fnpi where the fi are polynomials, use 
the product rule and then the fact that (fii(P) = 0 for all i. (c) If D 2 E(P) ^ 0 
then (0,1) ^ T p (£), so T p {£) is spanned by some (a, b) with a ^ 0. Thus 
x—xp+m 2 p can serve as the dual basis under the pairing, (d) One containment 
is clear. For the other, suppose s £ rap D Mp. Then s — r/t where r £ mp 
and t £ k[C] — mp. Since mp is maximal, 1 — tv £ mp for some v £ k[C]. 
But s = s(l — tv) + rv, showing that s £ mp. 

7.3.2. Renotate {vp} as {v\, . . . ,Vn} and use induction on N. The case 
N = 1 is clear since is a subspace of codimension 1 . For the induction step 
suppose some a^-i satisfies apj^i -Vi ^ 0 for i = 1, . . . , N — 1. If a^r-i - i>Ar 7^ 0 
then there is nothing to show. Otherwise take some w such that u ■ vn =/= 0 
and consider vectors a at = u — fcajv-i- 

7.3.4. (a) Let F^y) - (F(x,y)-F(x,-y))/(2y) andF 2 (x,y) = + 
— y))/2. These are both invariant under y ^ —y, making them functions 

of x. 

7.5.1. (c) First evaluate the limit termwise and then convince yourself that 
doing so is justified. 

7.5.3. (c) For the first part, f 0 (r(N)oo) = 3jv( ^~ 1) while (/ 0 o 7 )(r(iV)oo) 
is strictly smaller if 7 ^ r 0 (N). Geometrically, as Im(r) — > +00 so that 
j — > 00, the universal elliptic curve is degenerating to the singular curve 
y 2 = Ax 3, — 27x — 27, whose singular point (—3/2, 0) is an isolated point of the 
curve's real points. All iV-torsion points coming from complex torus points 
(ct + d)/N with c ^ 0 (mod N) go to the isolated point as Im(r) — > +00, 
but the A/'-torsion points coming from complex torus points d/N stay on the 
other real piece. Section 8.1 will discuss singular curves such as this one. For 
the second part, the formula before the display shows that jm is 7-invariant if 
and only if j is 7'-invariant. Clearly 7' has rational entries and determinant 1. 
If 7' £ SL 2 (Z) then j is 7'-invariant. If 7' ^ SL 2 (Z) then it identifies points 
that are incongruent under SL 2 (Z), because any r £% such that 7 '(t) = 8{t) 
for some 5 £ SL 2 (Z) satisfies a quadratic equation over Q. But j takes a 
different value at each point of SL 2 (Z)\"H, so it is not 7'-invariant. Thus jjv is 
7-invariant if and only if 7' £ SL 2 (Z), and it follows quickly from the display 
that this condition is 7 £ r 0 (N). 

7.5.4. First compute j for the curve y 2 = 4x 3 — 5 2 (t)x — 33 (r) obtained from 
the map (p, p'). Since the curve Ej differs from this curve by an admissible 
change of variable it has the same invariant. 

7.7.1. Since jjy = j(Ej/ (Qt)) an d since / 0 = x((Q T )) is the sum of the finite 
x-coordinates of (Q T ), it follows that = j(Ej / ' {Q%)) and f£ = x{{Q%)) for 
o £ Hq. In both cases the results obtained over C show that the fixing 
subgroup is the subgroup that preserves (Q T ), i-e., Q% — dQ T for some d. 
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7.8.1. (b) Since £(P) = 1 and L(P) contains k, L(P) is no more than k. Thus 
there is no function with a simple pole at P. Any set of elements with distinct 
valuations at P is linearly independent, by properties of the valuation. Thus 
the linear relation at the end must involve at least one of X 3 and Y 2 since 
the other five elements arc linearly independent. Take the coefficient of X 3 to 
be 1 and the coefficient of Y 2 to be a. Substitute aX and aY for X and Y 
and then divide by a 3 . 

7.8.2. (a) The group law is (a,P)(a',P') = (aa',a'(P) + P'). Check that 
(a,P)((a',P')f) = ((a,P)(a',P'))f. 

(b) The semidirect product acts on constant functions in K as H since 
translating the variable has no effect. Showing that K n 1 — k' in fact shows 
K n k' = k' by the nature of the rest of the configuration. 

(d) Write the restriction of i as \{E) HkC — > 1(E) H . 

7.8.4. tp o ip o tp = tp o [dcg(tp)] = [deg(tp)] o tp since p is a homomorphism. 
Since ip surjects it cancels on the right, giving the result. 

7.8.5. Take a point Q £ E' and any point P £ E such that p(P) = Q. Then 
at the level of divisors, i>*((Q) — {Oe 1 )) = ([deg(</?)]P) — (Oe) while since tp is 
unramified, 

p*((q)-(o E '))= J2 ( p + R )- E w- 

Rekcr(ip) Reker(ip) 

These are not equal, but their difference satisfies the characterization of prin- 
cipal divisors in Theorem 7.3.3 since deg(^) = |ker(y>)|, so their classes are 
equal in Pic°(E). 

7.9.2. (a) w N : X 0 (N) — ► X 0 (N) given by J T 0 (iV)r ^ r 0 {N)r' is a holo- 
morphic map of compact Riemann surfaces. By Section 7.3 it can be viewed 
instead as a morphism over C of algebraic curves over C. Its pullback is 
w* N : C(X 0 (N)) — > C(X 0 (N)), a C-injection of function fields over C. By 
Section 7.5 this is w* N : C(j(r), jjv(r)) — ► C(j(r), jjv(r)), taking j(r) to 
j(r') and j N (r) to ijv(r'). If j(r') and j N (r') are in Q(j(r), jjv(r)) then it 
restricts to w* N : Q(j(r), j'at(t)) — > Q(j(r), jAr(r)), a Q-injection of function 
fields over Q. This is w* N : Q(X 0 (iV) alg ) — > Q{X 0 (N) alg ) by Section 7.7, and 
now Theorem 7.2.6 gives wn '■ Xq^N)^ — > Xo(N) ll i g , a morphism over Q 
of algebraic curves over Q. For the last part, compute that j(r') — Jn{t) and 
3n{t') =j(r). 

(b) A point of X 0 (A)Pg nar is (j, x) £ Q 2 such that p 0 (j, x) = 0. The map 
is [E, C] i y (j(E),x(C)) where x(C) is the sum of the x-coordinates of the 
nonzero points of C . For the second part, recall Exercise 1.5.4. 

7.9.3. (b) The full extensions have the same Abelian Galois group. The 
upper and lower Galois extensions on the left inject into the ones on the right, 
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so the extension degrees must match. Now use the fact that the Galois group 
is cyclic. 

(e) Compute 

h{ pT y = x {Q T ,, N y = x{Q%, tN ) = x(<p(Q r , Np y) = x(<p°(C£ tNp )) 

= x(±Lp(Q T , Np )) = /i(pr), 
the second-to-last step using parts (b) and (d) to show that ip{Q° Np ) = 

<P(Qt,N P )- 

7.9.4. Consider the map [N] : E — > E and recall that the structure 
of Ec[N] was established in Chapter 1. 



Chapter 8 
8.1.1. (b) Consider the matrices 

u 2 0 r 
su 2 u 3 t . 
0 0 1 

For (c), the condition implies v 3 = w 2 . Take u = w/v. For (e), take r = —3/2, 
s = i3V2/2, t = 0, and u = i3\/2. 

8.1.5. (a) If char(k) ^ 2 then we may take a x = a 3 = 0 and so P — (x,0) 
where x is a repeated root of a cubic polynomial over k. If char(k) = 2 and 
ai 7^ 0 then x — a^a^ and y — a^ l {x 2 + 04). If char(k) = 2 and a\ — 0 then 
use the fact that every element of k is a square. 

8.2.1. For uniqueness, the q — 1 nonzero elements of such a subfield satisfy 
xi- 1 = 1. 

8.3.1. Recall that 1728Z\ = c\ - c§. 

8.3.2. (a) Since A' p = A/u 12 and v p (A' p ) < v p {A) it follows that v p {u p ) > 0. 
If v p (u p ) — 0 then we may take u p = 1 and r p = s p = t p = 0, so now assume 
v p (u p ) > 0. Since v p (di) > 0 and v p (al i ) > 0 the formula p = (ai + 2s p )/u p 
shows that v p (s p ) > 0. (Note that forp = 2 this requires ^2(^2) > 0.) Similarly 
the formula a' 2 p — (a 2 — s p ai + 3r p — s 2 )/u 2 shows that v p (r p ) > 0, and 
a' 3 = (03 + r p ai + 2tp)/u p shows that v p {t p ) > 0. 

(c) For each p | A let r p = p ep m p /n p . Then v p (r — r p ) = v p (n p r — p ep m p ) 
for r e Z. The congruence n p r = p ep m p (mod p 6 "p( t '') has a solution 
r (mod p 6 "p( u )), and thus v p {r — r p ) > 6v p (u). The Chinese Remainder The- 
orem gives an integer r simultaneously satisfying the condition for all primes 
p I A. 
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8.3.3. (b) If the a» and the a- are integral then so are the bi and the b\. The 
relation b' 2 — b 2 = 12r shows that v p (r) > 0 for all primes p except possibly 

2 and 3. If 1^3 (r) < 0 then the relation ^3(63 — fee) > 0 is impossible, and 
similarly for 2 and feg — fe 8 . 

(c) The relation ±a[ — a\ = 2s shows that v v (s) > 0 for all primes p 
except possibly 2, and the relations 1*2 (s) < 0 and — a 2 = — sai + 3r — s 2 
are incompatible. Similarly for t, using the relations between ±a' 3 and 03, and 
a' & and a 6 . 

8.3.5. Show that a p (E) = 0 (mod p) if and only if \E{Y p e)\ ^ 0 (mod p) for 
all e > 1. By Theorem 8.1.2 this holds if and only if E(F p e) n £[p] = {0 B } for 
all e > 1, and this in turn holds if and only if E[p] = {0}. 

8.3.6. (a) The discriminant is — 2 4 3 3 and C4 = 0. Recall Proposition 8.1.3. 

(b) If x 3 = 1 and x 7^ 1 then x 2 + x + 1 = 0, impossible in F p when 
p = 2 (mod 3) by Quadratic Reciprocity. So as x runs through F p so does 
x — 1, giving one point (a;, y) on the curve for each y € F p . Remember as 
well. 

(c) For the first displayed equality, remember 0e- For the last part, re- 
member that / is cubic. 

(e) The reduction is ordinary for p = 1 (mod 4), supersingular for p = 

3 (mod 4). 

8.4.1. (b) For the first part, (3 — a for some a <G Z, this a satisfies some 
monic polynomial / G Z[x], and so [3 satisfies the monic polynomial g = f G 
(Z/pZ)[x] obtained by reducing the coefficients of / modulo p, 

»(/?) = /(a) = /(S)=% = 0 Z/p . 

For the second part, since f(x) = Y\(x — ca) in Z[x] it follows that g(x) = 
IKs-Oi) in (Z/P)N- 

8.4.3. Since we don't have a valuation on Q, one method is to work in the 
number field K generated by the Weierstrass coefficients and the change of 
variable parameters, arguing as in the proof of Lemma 8.4.1 that this field has 
a valuation and then continuing as in Exercise 8.3.3(b,c). Another method is 
to use the lemma itself as follows. The conditions u £ Z^ and u 2 b' 2 — &2 = 12r 
show that r e Z( P ) unless p lies over 2 or 3. If r ^ Z( p ) then 1/r e pZ( P ) by 
the lemma and the union Z( p ) = Z^-j UpZ( p ). If p lies over 3 then the relation 

u%-b 6 = (2b i /r 2 + b 2 /r + 4)r 3 

gives a contradiction since the left side lies in Z( p ) and the first factor on the 
right side lies in Z^-,, but r 3 ^ Z( p ). If p lies over 2 then use the relation 

u% -b s = (3b 6 /r 3 + 3b A /r 2 + b 2 /r + 3)r 4 . 
Sore Z( p ). Argue similarly for s and t. 
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8.4.4. (c) Recall Exercise 8.3.6(e). 

8.5.1. / = - 1), {W ~ !)^-}>- 

8.5.2. (c) Part (b) applies with i — 0. 
8.5.4. /(o) = (xi + pxi,) works. 

8.5.6. (c) If yz e J and y ^ J then z e Anns(a;y), so (J, z) C Anns(xy), 
contradicting maximality unless z e J. 

8.5.7. (a) Let C be the curve and Cj the nonempty afhne piece. Then P e 
C — Ci if and only if (xi/xj)(P) = 0 for some j ^ i, but each Xi/xj (where 
Xj is not identically 0 on C) has a finite set of zeros. 

(b) This is immediate from (a) since the projective curve is infinite. 

8.6.1. Taking [Ej, Q] across and then down gives (j, x(Q)) and then (j, x(Q)). 
Taking it down and then across gives [Ej, Q] and then (j(Ej), x(Q)). These 
are the same. 

8.7.1. Let E' = E/C and Q' = Q + C. Let ip : E — > E' be the quotient 
isogeny, so that Q' = f(Q). Properties of degree and the calculation ker(<p) C 
ker([p]g) = E\p\ = {0} combine to show that <p = i o <j p where i : E a v — > E' 
is an isomorphism taking Q a " to Q 1 . The first equality follows. The second 
is shown similarly using the dual isogeny tp, citing Proposition 8.4.4(b), and 
applying o-~ 1 to the coefficients of the resulting isomorphism i in this case. 

8.7.2. (a) For the first diagram, going across and then down takes [Ej,Q] 
to [P; p ,Q CT f] and then to (j(Pj p ), x(Q CT f )), while going down and then 
across takes it to (j,x(Q)) and then to (j (Tp , x(Q) (Tp ). For the second dia- 
gram, going across and then down takes [Ej, Q] to p[E° p , Q a p ] and then to 
p(j(E/ ),x(Q^ )), while going down and then across takes it to (j,x(Q)) 

and then to p(j a p ,x(Q) a p ), cf. (8.15). In both cases the results are the same. 

(b) There is a birational equivalence h from Xi(7V) planar to X\(N) as 
described in Theorem 8.6.1. Consider the diagram 



and recall formula (8.17). The map across the bottom row descends to Picard 
groups. The other diagram is similar by formula (8.19). 

(c) Set up a cube diagram as in the section, but with (d) and (d)* across 

the top rows and with the reductions (d) and across the bottom rows. 



Div°(Xi(iV)P lanar ) 



*Div°(Xi(A0P lanar ) 



h 



Div°(Xi(JV)) 



>Div°(Xi(iV)) 
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Thus the bottom square is diagram (8.39). Explain why all the other squares 
of the cube commute and why the map from the top front left to the bottom 
front left surjects. Complete the argument. 

(d) Combine the second diagram from part (b) with diagram (8.39) to get 
a commutative diagram 

Div°(5i(JV)') ^^Div°(5i(A0') -^->Div°(Si(J\0') 



Pic 0 (Xi (iV)) ^Pic°(X!(iV)) ^Pic°(Xi(AT)). 

Along with the first diagram from part (b) this gives (8.34). 

8.8.1. (a) Pic°(X 0 (N)c) is generated by the image of X 0 (N)c, and the third 
stage of Pc is an isomorphism. 



Chapter 9 

9.1.1. The extension Q(d 1 ^ 3 )/Q is not Galois but the extensions F/Q(d 1 / 3 ), 
F/Q(^ 3 ), and Q(^3)/Q are. Recall from Exercise 8.3.6(b) that if p = 
2 (mod 3) then every a is a cube modulo p. 

9.2.1. (b) A basis of the product topology is the subsets S = J\ n S n where 
S n = Z/£ n Z for all but finitely many n. Each C n is the subgroup of compatible 
elements, naturally isomorphic to Z/£ n Z. 

9.2.2. Any nonidentity m € GL<j(C) that is close enough to I takes the form 
m = exp(a) where a e IVLj(C) is nonzero, and m™ = exp(na) for all tieZ. 

9.2.3. (a) Since A lies over £ it follows that X nex nZ = £ n 'Z for some n'. The 
condition £ m £ \ nex is Y[\ \ mex C X nex , and the unique factorization of ideals 
in Ok shows that this holds if and only if m > n. Thus n' = n. So the map 
Z — > 0 K /X nex has kernel £ n Z, making Z/£ n Z — > 0 K /X nex an injection for 
all n and A. This gives an injection Zg — > Ok, a for all A. 

(b) The injection surjects if |0 K /A" eA | = £ n for all n, i.e., |O k /A| Ea = £, 
i.e., C\f\ = 1. 

9.3.1. Let U be any open normal subgroup. Then U (F) C U for some Galois 
number field F, giving a surjection Gal(F/Q) = G Q /U(F) — > Gq/U. This 
shows that Gq/U = Gal(F'/Q) for some F' C F, and so U = U(F'). 

9.3.2. Consider any neighborhood U = U a (F) in Gq. We want to find some 
Frobp e U. This holds if Frob p |F = c|f. But <t|f takes the form Frob PF for 
some maximal ideal of Of by Theorem 9.1.2. Lift pp to a maximal ideal p 
of Z. 
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9.3.4. Take a neighborhood V of I in GLd(C) containing no nontrivial sub- 
group, cf. Exercise 9.2.2. Let U = As a neighborhood of 1 in Gq, U 
contains U(F) for some Galois number field F. So p is defined on Gal(F/Q). 

9.3.5. Let M = lcm(AT, N'), so p\q^ m \ can be viewed via the isomor- 
phism (9.1) as a character \ of (Z/MZ)* that factors through (Z/NZ)* 
and (Z/N'Z)* . Since x(n mod M) is trivial if n = 1 (mod AT) and if 
n = 1 (mod AT') it follows that x is defined modulo gcd(AT, AT'). 

9.3.7. (a) L d is a <i-dimensional topological vector space over L. The groups 
GLd(L) and Aut(L d ) are naturally identified. Since p is continuous so is its 
composition with vector-by-matrix multiplication, 

L d x G Q — > L d , (v, a) i y vp(a). 

Thus p makes L d a GQ-module satisfying the continuity condition of Def- 
inition 9.3.4. On the other hand, any choice of ordered basis of V identi- 
fies Aut(V) with GLd(L), and then the map Gq — > Aut(V) gives a map 
p : Gq — ► GLd(L). Specifically, the matrix entries are p(cr)ij — Xj(ei<r) 
where the ordered basis is (ei,...,e<i) and Xi : V — > L is *Yl, L a L e L i-> a^. 
Each pij is a continuous function, making p a Galois representation as in 
Definition 9.3.2. 

(b) Equivalent representations p and p' as in Definition 9.3.2 determine 
GQ-linear isomorphic GQ-module structures of L d , and if V and V are equiv- 
alent as in Definition 9.3.4 then any choice of ordered bases B and B' de- 
termines equivalent representations as in Definition 9.3.2. Also, going from p 
to L d and then choosing a basis B gives a representation p' equivalent to p, 
while on the other hand starting with V and then choosing a basis to define p 
gives L d a GQ-module structure equivalent to V. 

9.4.3. The isogeny E — > E' induces a map V e (E) — ► V e (E'). The map 
is an isomorphism since the dual isogeny induces a similar map in the other 
direction and the composite is multiplication by the degree of the isogeny, an 
automorphism because Qe has characteristic 0. 

9.4.4. (c) Since ^-torsion contains an ^-cyclic subgroup of points with rational 
coordinates, p E e <~ [057] where xi '■ Gq — ► F 2 is the mod 2 reduction of 
the cyclotomic character. Alternatively, the f-cyclic subgroup of points with 
rational coordinates implies that t \ \E(Ft)\, i.e., a p (E) = p + 1 (mod £). 

9.5.1. This follows from Lemma 9.5.2. 

9.5.2. (a) Each Kj \ acts on Vf \ via The dimension is 2 because Vf \ — 
e\Vi(Af) = e\(Kf ®q Q^) 2 = K 2 A . Apply each e\ to any linear dependence 
J2y e \ lV \' = 0 to show that the sum J] A Vf,\ is direct. The relation v = 
J2\ e \ v f° r an Y v € V«(j4/) shows that the sum spans V^(j4/). The GQ-action 
restricts to V/ ; a because it commutes with e\. 
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(b) The first part is explained at the end of Section 9.2. For the second 
part, pA f ,e is continuous, making Vg(Af) x Gq — ► continuous, and 

Vf t \ is a Q^-vector subspace of Ve(Af). 

9.6.1. (a) Since conj has order 2 the only possible eigenvalues are ±1. Recall 
that detp(conj) = — 1. 

(b) Irrcducibility immediately gives a with b ^ 0 and a' with c ^ 0. If 
neither a nor o 7 works then ctct' does. For the last part, conjugate by a matrix 
of the form 

9.6.4. (a) To verify that the map is an embedding check the orders of the 
elements by using characteristic polynomials to compute eigenvalues. This 
also shows that the two elements of GL 2 (F 3 ) generate a subgroup H of order 
divisible by 24. Since A 4 is the only order 12 subgroup of S4, SL 2 (F 3 ) is 
the only order 24 subgroup of GL 2 (F 3 ), but the second generator of H has 
determinant — 1. 

(b) The first statement follows from the end of part (a). Working modulo A, 
compute for p \ 3MjNe that 

a P (.f) = trp /A (Frob p ) = trp B3 (Frob p ) = a p (E). 

Since V = detp/^ is a lift of detp E3 = \3 (mod 3) from F 3 to it 
suffices to consider the latter character, detp E3 = X3 (mod 3). This sur- 
jects, making tp quadratic, and since as a Galois representation it is defined 
on Gal(Q(/Z3)/Q), as a Dirichlet character tp has conductor 3. 

(c) Use results in Chapter 4 to evaluate the leading coefficient of the Eisen- 
stein series. 

(d) Since ip(p) = p (mod A) the operators T p on Si(Mf,tjj) and T p 
on S2(r 0 (Mf)) are congruent modulo A. This observation and part (c) show 
that T p g = T p f = a p (f)f (mod A). On the one hand the right side is congru- 
ent modulo A to a p (E)g for all but finitely many p by parts (b) and (c), giving 
the first statement. On the other hand the right is also congruent modulo A 
to a p (g)g = ai(T p g)g for all p by part (c). The second statement follows. 

(e) For all but finitely many primes <j>(T p ) = a p (E) (mod A), and so T p — 
a p (E) e kei(f> — m, implying 4>'{T p ) — a p (E) e 4>'{m) C A'. 

(h) The argument in Exercise 9.6.1 applies over any field whose character- 
istic is not 2. Let L be a finite Galois extension of Q containing K g » and K g i . 
Use Strong Multiplicity One to show that if a e Gal(L/Q) then {g") a is the 
newform associated to (g') cr ; in particular if a fixes K ff / then it fixes K s ». 
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77 
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78 
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group), 206 
P X (K) (projective line over K), 256 
P 2 (k) (projective plane over k), 252 
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over K), 255 
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q (e 2 ' KlT where r € ~H, point of the 

punctured unit disk), 3 
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p x (Galois representation associated 
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pE,e (Galois representation associated 

to E), 383 
p/,A (Galois representation associated 
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Px 1 (N),e (Galois representation 

associated to Xi(iV)), 387 
p E e (mod I representation associated 

to E), 385 
Pf x (mod £ representation associated 

to /), 395 

S 0 (N) (moduli space for r 0 (N)), 37 
Si (AT) (algebraic moduli space), 309 
Si (JV) (moduli space for A (AT)), 38 
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Si(AT) a i g (algebraic moduli space 

for A (AO), 301 
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5(A) (cusp forms of all weights with 
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Sk(r) (cusp forms of weight k with 
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of W N ), 204 
S fc (A(A0) now (newforms at level AT), 
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S fc (A(A0) old (oldforms at level N), 188 
Sk(n) (feth power sum up to n - 1), 134 
5fc(SL 2 (Z)) (cusp forms of weight fc), 6 
S(AT) (moduli space for r(N)), 38 
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sums), 134 
5(SL 2 (Z)) (cusp forms of all weights), 6 
Cfc-i(w) (arithmetic function), 5 
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<7fc_i(n) (arithmetic function), 115 
<ja(z) (Weierstrass <r-function), 138 
a v (Frobenius map), 317 
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T p (Hecke operator, p prime), 169 
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to Af), 389 
Ta^(C) (€-adic Tate module of the curve 

xy = 1), 373 
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9 X (symmetrized theta function, 
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associated to a), 152 
$£(7) (theta function associated to v), 

151 

$(t, I) (/-dimensional theta function), 
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(theta function), 156 
T p (Hecke operator in characteristic p), 
349 

U m (n) (neighborhood in GL d (Z e )), 374 
t/ CT (F) (neighborhood in G Q ), 377 
U v (n) (neighborhood in Zf), 374 
U x (n) (neighborhood in Zi), 374 

v (row vector of order N in (Z/iVZ) 2 ), 
111 

Ve(Af) (£-adic vector space associated 

to Af), 390 
Ve(C) (i'-adic vector space associated to 

the curve xy = 1), 380 
Ve(E) (£-adic vector space associated 

to E), 383 
Ve(Xi(N)) (£-adic vector space 

associated to Xi(N), 388 
Vf (vector space associated to /), 242 
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to /), 391 
Vr (volume of X(r)), 181 

Wn (normalized kjjv operator), 204 
wn (involution), 43 

Xo(N) (compact modular curve for 

r 0 (N)), 58 
Xi(N,p) (compact modular curve), 176 
Xi y o(N,p) (modular curve), 306 
Xi(N) (compact modular curve for 

A (JV)). 58 
X\(N) (modular curve as nonsingular 
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Xo(N)ei s , Xi(N)a.i s (modular curves as 
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Xi(iV)P^ nar (planar model of the 

modular curve), 291 
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the universal elliptic curve), 283 
X(r) (compact modular curve), 58 
X(N) (compact modular curve for 

r(N)), 58 
£(s) (symmetrized zeta function), 121 

Yo(N) (noncompact modular curve for 

r 0 (N)), 38 
Yi(N) (noncompact modular curve for 

A (AO), 38 
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the universal elliptic curve), 283 
Y(r) (noncompact modular curve), 38 
Y(N) (noncompact modular curve for 

J\A0), 38 
Yi(N,p) (noncompact modular curve), 

43 

Zt (£-adic integers), 373 
Za(z) (Weierstrass zeta function), 138 
Z( p ) (localization of Z at p), 334 
Z (ring of algebraic integers), 231 
Z( P ) (localization of Z at p), 329 
Of (identity element of an elliptic 

curve), 252 
C+(&) (modified zeta function), 113 
C+(fe, /u) (modified zeta function), 114 
£ " (k) (modified zeta function), 115 
£(fc) (Riemann zeta function), 4 
C(s, r) (Hurwitz zeta function), 135 
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algebraic, 254, 312 

geometric, 253, 312 
adjoint, 183 

admissible change of variable, 251, 311 
affine plane, 252 
algebraic closure of a field, 250 
algebraic curve 

affine, 258 

nonsingular, 258 
algebraic curves 

isomorphic, 263 

isomorphic over k, 264 
algebraic element over a field, 250 
algebraic field extension, 250 
algebraic integer, 231 
algebraic number, 230 
algebraically closed, 231 
algebraically closed field, 250 
Artin's Conjecture, 395 
automorphic form of weight k, 72 

base point, 214 
Bernoulli number, 9, 134 

of V, 135 
Bernoulli polynomial, 134 
Bezout's Theorem, 252 
birational equivalence of algebraic 
curves, 264 



Birch and Swinnerton-Dyer Conjecture, 
362 

canonical divisor on a Riemann surface, 
84 

commensurable congruence subgroups, 
164 

compatibility condition for meromor- 
phic differentials on a Riemann 
surface, 78 
complex multiplication, 30 
complex torus, 25 
conductor 

of a Dirichlet character, 117 
of a normalized eigenform, 198 
of an elliptic curve 
algebraic, 324 
analytic, 292 
congruence subgroup, 13 
conjugate function fields, 264 
coordinate ring of an algebraic curve, 
257 

Cubic Reciprocity Theorem, 160 
Curves-Fields Correspondence 

Part 1, 264 

Part 2, 265 
cusp form of weight k, 6 

with respect to a congruence 
subgroup, 17 
cusp of a modular curve, 58 

irregular, 89 

regular, 89 
cusp of an elliptic curve, 314 
cyclic quotient isogeny, 27 
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decomposition group, 369 

absolute, 377 
decomposition index, 367 
Dedekind eta function, 19 
degree 

of a map of Riemann surfaces, 65 
of a morphism 

inseparable, 320 

separable, 320 
of a morphism of algebraic curves, 
269 

degree-0 divisors of an algebraic curve, 
270 

desingularizing an algebraic curve, 264 
Dirichlet //-function, 121 
Dirichlet character 

modulo N, 116 

primitive, 117 

trivial, 116 
Dirichlet 's Theorem on Arithmetic 

Progressions, 370 
discriminant 

function on H, 6 

of a quadratic number field, 367 

of a Weierstrass equation, 250, 310 
divisor group of an algebraic curve, 269 
divisor on a Riemann surface, 83 
double coset, 103, 164 
double coset operator 

as map of Picard groups, 227 

on Jacobians, 228 
dual isogeny, 28 

Eichler-Shimura Relation, 354 
eigenform, 186, 195 

normalized, 195 
eigenspace of A4fc(A(-A")), 119 
Eisenstein series of weight k, 4 

for a lattice, 32 

normalized, 10 
Eisenstein series of weight 2, 18 

normalized, 19 
i?-adic cyclotomic character, 379 
i?-adic integer, 373 
i?-adic Tate module 

associated to Af, 389 

of a modular curve, 386 

of an elliptic curve, 382 

of the curve xy = 1, 373 



elliptic curve 
complex, 35 
ordinary, 313 

over a field of characteristic 0, 251 
over an arbitrary field, 310 
supersingular, 313 
universal, 282, 311 
elliptic point of a congruence subgroup, 
48 

enhanced elliptic curve 
for r(N), 38 
for A (AT), 37 
for /o (TV), 37 
for A (AT) 
algebraic, 301 
complex algebraic, 301 
equivalence of Galois representations, 

379, 380 
Euler product, 200 
exceptional points of a map, 65 

factor of automorphy, 14 
Faltings's Isogeny Theorem, 362 
Fermat's Last Theorem, 396 
field extension 

inseparable, 319 

purely inseparable, 319 

separable, 319 
Font aine-Mazur-Langlands Conj ecture , 
395 

forward change of variable formula, 218 
forward map 

of Jacobians, 218 

of Picard groups, 220 
four squares problem, 11 
Fourier transform, 9, 143 

finite, 150 
free Q-module on the points of a 

modular curve, 86 
Frobenius element 

absolute, 377 

of a Galois group, 369 
Frobenius map 

on an algebraic curve, 318 

on F p , 317 
function field of an algebraic curve, 257 
function field over k, 264 
functional equation 

of A 120 
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of L(sJ), 204 
fundamental domain for SL2(Z), 54 

Galois representation 

^-adic, 378, 380 

associated to Af, 389, 390 

associated to /, 390, 391 

associated to Xi(iV), 387, 388 

associated to an elliptic curve, 383 

geometric, 395 

modular, 391, 395 

odd, 393 

unramified, 380 
gamma function, 120 
Gauss sum of a Dirichlet character, 118 
generating function, 11, 134, 179 
genus, 65 

of a modular curve, 62 
genus formula, 68 
global minimal discriminant of an 
elliptic curve, 323 

Hecke algebra 

over C, 234 

over Z, 234 
Hecke operator (d) (diamond operator), 
169 

Hecke operator T p , 169 

four compatible notions of, 175 
Hecke operators on Jacobians, 229 
Hilbert Basis Theorem, 266 
holomorphic at oo, 3, 16 
homology group of a Riemann surface, 
213 

Hurwitz zeta function, 135 
hyperbolic measure on H, 181 

Igusa's Theorem, 348 
inertia group, 369 

absolute, 377 
integrally closed, 232 
invariant of a Weierstrass equation, 250, 
310 

inverse limit, 373 

inverse Mellin transform, 147 

isogeny, 27, 244 

isomorphism of algebraic curves, 263 
isotropy subgroup, 48 



Jacobian 

of a modular curve, 227 
of a Riemann surface, 214 

K-points off, 254 
Kronecker symbol, 367 
Krull Intersection Theorem, 342 
Krull topology, 377 

i-function 

of a modular form, 200 

of an elliptic curve (Hasse-Weil), 361 
A-adic integers, 375 
A-adic numbers, 375 
lattice in C, 25 
Legendre relation, 138 
level of a congruence subgroup, 13 
linear space of a divisor on a Riemann 

surface, 84 
Lipschitz formula, 147 
local ring 

of an algebraic curve at a point, 259 

of the Hecke ring, 239 
localization 

of Z at p, 329 

of Z at p, 334 

maximal ideal of the local ring at a 

point, 259 
Mellin transform, 136, 145, 204 
meromorphic differential on a Riemann 

surface, 78 
Mobius function, 113 
mod I representation, 385 

modular, 395 
modular curve, 38 
modular form of weight k, 4 

with respect to a congruence 
subgroup, 17 
modular function, 7 
modular group, 1 
modular invariant, 7 
modular parameterization of an elliptic 

curve, 63 
Modularity Theorem 

strong Version Aq , 362 

strong Version R, 392 

Version A c , 246 

Version a p , 356 
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Version Jc, 215 

Version Jq, 293 

Version L, 362 

Version R, 392 

Version Xc , 63 

Version X Q , 292 
moduli space, 38 

for A (AT) 
algebraic, 301 
complex algebraic, 301 
Mordell-Weil Theorem, 254 
morphism 

inseparable, 319 

of algebraic curves, 263 
defined over a field, 264 

purely inseparable, 319 

separable, 319 
Multiplicity One property of newforms, 
196 

multiply-by-integer isogeny, 27 

n-dimensional projective space over K, 
255 

iV-torsion subgroup 

of a complex torus, 30 

of an elliptic curve, 254 
Nakayama's Lemma, 235 
new, 173 
newform, 195 
newforms at level N, 188 
node of an elliptic curve, 314 
Noetherian ring, 266 
nonsingular, geometrically, 251 
norm map of function fields, 219 
normal linear operator, 183 
Nth division polynomials of an elliptic 

curve, 254 
number field, 231 

of a modular form, 234 
number ring of a field, 232 

old, 173 

oldforms at level N, 187 
orthogonality relations, 117 
modified versions, 118 

p-adic valuation, 322 
perfect field, 321 



perfect pairing, 243 
period of an elliptic point, 49 
Petersson inner product, 182 
Picard group 

of a Riemann surface, 214 

of an algebraic curve, 270 
Poisson summation formula, 9, 144 
polynomial function on an algebraic 

curve, 257 
power sum, 133 
prime subfield of a field, 312 
principal congruence subgroup of 

level N, 13 
principal divisor, 270 
profinite, 374 
projective line, 256 
projective plane over k, 252 
pullback 

of function fields, 217 

of function fields of algebraic curves, 
265 

of holomorphic differentials, 218 
of meromorphic differentials, 77 

ramification degree, 367 

of a map of Riemann surfaces, 65 
of a morphism at a point, 269 
ramified prime in a number field, 367 
rational function defined over a field, 
263 

rational function on an algebraic curve, 
257 

rational integer, 231 

rational map of elliptic curves, 264 

reduction of a projective algebraic curve 

over Q at p, 338 
reduction of an affine algebraic curve 

over Q at p, 337 
reduction of an elliptic curve over Q 
bad, 323 
additive, 323 
multiplicative, 323 
good, 323 
ordinary, 323 
supersingular, 323 
representation number, 11 
residue degree, 367 
residue field, 367 

reverse change of variable formula, 221 
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of Jacobians, 222 

of Picard groups, 224 
Riemann surface, 25, 47 
Riemann zeta function, 4, 121 
Riemann-Hurwitz formula, 66 
Riemann-Roch Theorem, 84 

Schwartz function, 151 
Serre's Conjecture, 396 
Strong Multiplicity One property of 

newforms, 198 
Sylvester matrix, 232 

tangent line to an algebraic curve, 259 
Tchebotarov Density Theorem, 371 
tensor product of a finitely generated 

Abelian group and a field, 240 
theta function, 143 

trace map of holomorphic differentials, 
221 

transcendental element over a field, 250 

uniformizer at a point, 260 
upper half plane, 2 



valuation at a point, 261 
variety, 293 

weakly modular function of weight k, 2 

with respect to a congruence 
subgroup, 14 
Weierstrass u-function, 138 
Weierstrass p- function, 31 
Weierstrass equation 

p-integral, 330 

p-minimal, 331 

Doming form, 331 

global minimal, 323 

Legendre form, 331 

nonsingular, 251, 310 

over a field of characteristic 0, 250 

over an arbitrary field, 310 
Weierstrass polynomial, 251, 311 
Weierstrass zeta function, 138 
weight- k double coset operator 

on divisor groups, 167 

on modular forms, 165 
weight-fc operator, 14 
Weil pairing, 30, 275 
width of a cusp, 59 
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